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EXECUTIVE  SUMMARY 
AND  INTRODUCTION 
I.  BACKGROUND 
A.   CONGRESSIONAL  MANDATE.     In  the  Social  Security  Amendments  of  1983, 
Congress    instructed    the    Secretary    of    the    Department    of    Health    and    Human 
Services  to  report  on  "the  appropriateness  of  the  factors  used.. .to  compensate 
hospitals    for    the    additional    expenses   of   outlier   cases,   and   the   application   of 
severity     of     illness,     intensity     of      care,      or      other      modifications     to      the 
diagnosis-related  groups,  and  the  advisability  and  feasibility  of  providing  for  such 
modifications."  (P.L.   98-21,  section   603(a)(2)(C)(iii)).      This   report   establishes   a 
conceptual  framework  for  considering  these  issues,  provides  a  summary  of  research 
conducted  to  answer  the  question,  and  presents  options  for  future  research. 
3.   OUTLIERS,    SEVERITY,    INTENSITY,    AND    THE    PROSPECTIVE    PAYMENT 
SYSTEM.    Medicare's  Prospective  Payment  System  (PPS)  controls  Medicare  outlays 
and  provides  incentives  for  efficiency  by  determining  in  advance  the  amount  which 
Medicare  will  pay  a  hospital  for  a  hospitalization  or  "case".    Because  cost  per  case 
differs  among  disorders  and  among  patients  with  similar  disorders,  payments  to 
each  hosptial  must  be  adjusted  to  reflect  the  "case  mix"  of  that  particular  hospital. 
The  accuracy  of  this  process  of  case  mix  adjustment  is  critical  to  the  equity  and 
long  term  success  of  PPS. 

When  the  Administration  and  Congress  designed  PPS,  research  had  shown  that 
five  variables  had  a  measurable  influence  on  how  a  hospital's  average  cost  per 
admission  compared  with  that  of  other  hospitals.  The  designers  of  PPS  constructed 
the  systeiTi  so  that  it  would  determine  payments  based  on  four  of  these  variables: 

Case  Mix  Index  (CMlKthe  average  resource  requirements  of  a  case  as 

measured  by  diagnosis-related  groups(DRGs)), 

Labor  Costs  (measured  by  the  Area  Wage  Index), 
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Urban/Rural    Location    (measured    by    the    size    of    the    Metropolitan 

Statistical  Area,  if  any,  in  which  the  hospital  is  located),  and 

Teaching  Intensity  (measured  by  the  number  of  interns  and  residents  per 

hospital  bed). 
The  designers  of  PPS  knew  that  a  fifth  variable. 

Hospital  size  (measured  by  number  of  beds) 
also  predicted  variation  in  cost  per  case,  but  they  decided  against   determining 
payments  on  the  basis  of  hospital  size. 

Table  ES.IA  shows  that  these  five  factors  can  explain  about  69-72  percent  of 
variation  in  average  operating  cost  per  case  among  hospitals.  As  noted,  PPS  does 
not  consider  hospital  size  in  determining  payments;  Table  ES.IB  shows  that  case 
mix  index,  labor  costs,  urban/rural  location,  and  teaching  intensity  without  hospital 
size  explain  66-69  percent  of  variation.  PPS  does  not,  for  reasons  explained  in  the 
report  (Chapter  ^.I)  use  case  mix  index,  labor  costs,  urban/rural  location,  and 
teaching  intensity  in  a  way  which  most  accurately  simulates  current  costs;  Table 
E5.1C  shows  that  when  these  factors  are  used  in  the  way  that  PPS  was  actually 
implemented,  they  explain  6^-66  percent  of  variation.  The  fact  that  the  payment 
system  accounts  for  only  two  thirds  of  variation  raises  a  question  whether  the 
remaining  third  is  due  to  factors  for  which  Medicare  should  not  pay  or  whether  the 
system  is  imprecise.  Four  factors  ar-^  thought  to  account  for  most  of  the 
remaining  variation: 

Outliers  (cases  so  unusual  that  they  were  not  properly  described  by  the 

DRG  in  which  they  were  classified). 

Severity  of  Illness  (defined  as  the  variation  in  the  clinical  illness  within  a 

DRG). 

Intensity  of  Care  (defined  as  variation  in  care  for  patients  in  the  same 

DRG   with  the   same   clinical   severity  of   illness).      Some   variation   in 


intensity    of    care     is     medically     appropriate    while    some    variation 
constitutes  either  wasteful  or  inappropriate  care). 

Efficiency    (inefficiency    is    obvious    when    the    cost    of    providing    an 

individual  service  is  unnecessarily  high;  inefficiency  and  intensity  of  care 

snay  be  difficult  to  distinguish  if  the  need  for  a  service  is  questionable). 

Satisfactory  measures  were  not  available  for  efficiency  or  for  variations  in 

severity  and  intensity  within  DRGs.    Outliers  could  be  identified,  but  estimates  of 

their  importance  were  not  available.    The  design  of  PPS  protected  hospitals  against 

these    uncertainties   by    several    cushioning   devices,    notably    a   transition   period 

before  hospitals  would  be  paid  wholly  under  a  national   DRG-based  system.      In 

addition.  Congress  asked  for  this  report  before  the  end  of  the  transition  period. 

Thus,   the   congressional   mandate   to   report   on   outliers,   severity,   and   intensity 

requires   examining   the   unexplained   variation   in   average   cost   per   case   among 

hospitals. 

C.  PPS;  INCENTIVES,  EQUITY,  AND  PROTECTION.  We  have  already  indicated 
that  many  payment  factors  other  than  the  DRG  and  outlier  systems  provide  case 
mix  adjustment.  The  corrections  for  teaching  activity,  hospital-specific  and 
regional  costs,  and  urban/rural  location  are  implicitly  corrections  for  variations  in 
severity  of  illness  and  intensity  of  care.  Thus,  we  can  only  evaluate  the  DRG  and 
outlier  systems,  and  the  importance  of  unmeasured  variation  of  severity  and 
intensity,  as  part  of  the  overall  payment  system.  Three  concepts  are  important  for 
this  evaluation: 

1.  Incentives.  The  key  incentives  for  efficiency  in  PPS  are  that  payment  is 
determined  before  services  are  rendered  and  that  the  unit  for  payment  is  the 
hospitalization,  a  large  "bundle"  of  individual  services.  Available  data  indicate 
that         such         incentives         are         strong         even         if         hospitals         are 


cushioned  by  various  adjustments  and  that  these  incentives  are  almost  independent 
of  whether  rates  are  national,  regional,  or  even  partly  hospital-specific.  The 
Secretary's  Report  on  the  Impact  of  the  Medicare  Prospective  Payment  System  for 
1984  described  dramatic  hospital  responses  to  these  incentives  in  the  first  year  of 
PPS,  including  large  decreases  in  length  of  stay,  staffing,  and  costs.  These 
responses  make  clear  that  the  incentives  were  very  effective  even  when  payments 
were  largely  based  on  historical  hospital  costs. 

2.  Equity.  Case  mix  adjustment  must  achieve  two  kinds  of  equity:  it  must 
pay  hospitals  an  amount  which  covers  the  cost  of  efficiently  providing  necessary 
care  of  Medicare  beneficiaries  and  it  must  pay  in  a  way  which  does  not  give 
hospitals  an  incentive  to  discriminate  against  any  class  of  beneficiaries  or  against 
beneficiaries  with  any  particular  clinical  condition. 

If  PPS  does  not  pay  some  hospitals  on  average  enough  to  provide  medically 
appropriate  care,  these  hospitals  may  face  unfair  financial  hardship  and  possible 
insolvency;  beneficiaries  may  face  decreased  quality  of  care  or  difficulty  in 
securing  care.  Unfairness  will  occur  if  outliers,  severely  ill  cases,  or  more  intense 
care  concentrate  in  certain  hospitals,  in  a  way  which  is  not  otherwise  recognized 
by  the  payment  system. 

This  report  focuses  on  the  ability  of  the  PPS  case  mix  adjustment  system  to 
achieve  both  kinds  of  equity.  A  key  feature  of  equity  to  hospitals  is  that  payments 
need  not  be  accurate  for  each  admission,  but  they  must  be  equitable  on  average. 
On  the  other  hand,  equity  to  patients  requires  that  a  hospital  not  be  able  to 
identify  in  advance  that  any  patient  will  need  more  care  than  Medicare  will  pay 
for.   Such  beneficiaries  would  be  targets  for  discrimination  and  inadequate  care. 


3.  Protection.  The  teaching  and  urban/rural  adjustments  were  designed  to 
cushion  urban  and  teaching  hospitals  against  large  changes  in  payments.  To  guard 
against  possible  inequities,  Congress  also: 

enacted  provisions  for  special  payments  for  outlier  cases, 

doubled    the    Department's   proposed    adjustment    for   indirect   costs  of 

teaching  from  5.79  percent  to  11.58  percent, 

created   a  transition  period  during   which  hospital  payments  would  be 

based  in  part  on  the  hospital's  historical  costs  and  on  regional  rather  than 

national  payment  rates, 

excluded    certain   types   of   psychiatric,    rehabilitation,    long-term,   and 

children's  facilities, 

provided    for    special    treatment    of    cancer    hospital,    sole    community 

hospitals,  rural  referral  centers,  and  hospitals  with  a  "disproportionate" 

share  of  poor  and  Medicare  patients,  and 

required    this    report    on    the    appropriateness   of   the    outlier   payment 

system,   as   well  as   on   the   severity   and    intensity    issues,    before    the 

transition  period  was  scheduled  to  expire. 

Congress  also  inacted   higher   PPS   updates   than   the  Secretary  deems 

necessary  for  efficient  and  quality  health  care. 
Hospital-specific,  regional,  urban/rural,  and  teaching  adjustments  were  all 
designed  to  cushion  hospitals  from  undue  dislocation  as  PPS  was  put  in  place.  To 
the  extent  that  any  of  these  "cushions"  is  removed  (and  the  law  schedules  hospital- 
specific  and  regional  adjustments  to  phase  out  by  fiscal  year  1988),  accurate  case 
mix  adjustment  becomes  increasingly  critical  to  the  long-term  viability  of  PPS. 

The  key  issue  in  continued  refinement  of  PPS  is  preserving  incentives  without 
endangering  either  the  system's  equity  or  its  viability. 


C.  PREOICTING  COST  PER  CASE.  Because  the  components  of  case  mix 
adjustment  are  interdependent,  compromises  or  imprecision  in  measuring  or 
applying  one  component  may  influence  the  apparent  accuracy  of  another.  Several 
such  compromises  were  made  in  the  design  of  PPS.  Although  CMI,  labor  cost, 
urban/rural  location,  teaching  intensity,  and  hospital  size  can  explain  72  percent  of 
variation  in  average  cost  per  case,  some  compromises  were  made  in  the  design  of 
PPS  that  reduce  its  precision  below  that  level  (compare  Tables  ^.lA  and  ^.IC  and 
see  Chapter  ^.I). 

1.  Labor  Cost.  Hospital  cost  per  case  rises  almost  exactly  1  percent  for  every 
1  percent  increase  in  the  wage  index,  but  PPS  allows  payments  to  change  only  0.79 
percent  because  labor  costs  contribute  directly  to  only  79  percent  of  hospital  costs. 

2.  Urban/Rural  Location.  The  Department's  original  calculations  (Pettengill 
and  Vertrees  19S2)  used  four  levels  of  urbanicity  (from  large  city  to  rural),  but  PPS 
distinguishes  only  between  urban  and  rural  locations  in  setting  standard  payment 
amounts.  Moreover,  PPS  treats  all  rural  areas  in  a  State  as  a  single  area  in  setting 
wage  adjustments. 

^'  Hospital  Size.  Research  shows  that  average  cost  per  case  increases  with 
hospital  size,  but  conventional  economic  theory  suggested  that  this  increase  was 
due  to  inefficiency.  Paying  more  to  large  hospitals  also  appeared  to  be  bad  public 
policy  in  a  health  care  industry  which  appeared  to  have  an  excess  of  beds.  The 
designers  of  PPS,  therefore,  decided  not  to  adjust  payment  per  case  for  hospital 
size  because  such  an  adjustment  might  encourage  inefficiency  and  reward 
construction  of  unneeded  beds. 


^.  Hospital  Size  and  Teaching  Activity.  When  hospital  size  was  not 
considered,  1981  data  showed  that  hospital  average  cost  per  discharge  increased 
8.^5  percent  (Table  £S.l)  for  every  tenth  of  an  intern/resident  per  bed;  when  the 
hospital  size  was  considered,  the  increase  was  5.795  percent.  Congress  doubled  the 
latter  figure  in  setting  a  payment  adjustment  of  11.6  percent;  the  Consolidated 
Omnibus  Budget  Reconciliation  Act  of  1985  reduced  this  adjustment  to  8.1-8.7 
percent. 
D.   THE  NATURE  OF  OUTLIERS,  SEVERITY,  COMPLEXITY,  AND  EFFICIENCY. 

1.  Outliers.  The  DRG  classification  system  was  not  designed  to  deal  with 
unusual  cases.  The  fact  that  there  are  only  ^68  DRGs  means  that  uncommon  cases 
cannot  be  classified  into  a  group  containing  similar  cases.  The  concept  of  outliers 
and  outlier  cutoffs  was  basic  to  the  design  of  the  DRGs  because  DRGs  were 
designed  to  identify  outliers,  but  not  to  explain  their  high  costs.  Congress, 
therefore,  instructed  the  Department  to  set  aside  5-6  percent  of  total  PPS 
payments  to  pay  for  the  "marginal  cost"  of  "outliers"  and  defined  outliers  as 
patients  with  either  unusually  long  stays  or  unusually  high  costs.  Regulations 
defined  length-of-stay  and  cost  thresholds  and  defined  marginal  cost  as  60  percent 
of  the  average  per  diem  cost  for  nonoutlier  patients  for  each  day  beyond  the 
outlier  threshold,  and  as  60  percent  of  the  difference  between  the  calculated  cost 
and  the  cost  threshold  amount  for  cost  outliers.  There  was  little  research  basis  for 
these  decisions.  Thus,  the  congressional  mandate  to  examine  outlier  payments 
requires  an  examination  of  the  impact  of  the  outlier  policy  under  actual  PPS 
conditions  and  a  study  of  the  relation  of  outlier  payments  to  other  components  of 
case  mix  adjustment. 


2.  Severity  of  Illness.  Severity  of  illness  is  the  patient's  clinical  condition  at 
admission  as  indicated  by  the  clinical  likelihood  of  adverse  outcomes  such  as  death 
or  disability.  Severity  is  a  somewhat  ambiguous  term  because  it  includes  both 
short-term  prognosis  (as  in  a  pneumonia)  and  long-term  prognosis  (as  in  cancer), 
and  because  the  severity  of  an  illness  for  a  patient  includes  both  the  illness  itself 
and  the  general  health  of  the  patient.  The  DRG  system  measures  severity  of 
illness  both  directly,  by  the  diagnosis  recorded  and  by  the  presence  of 
comorbidities  and  complications,  and  indirectly,  through  the  patient's  age  and  the 
specific  procedures  recorded.  The  Health  Care  Financing  Administration  (HCFA) 
has  spent  more  than  $10  million  on  severity  research.  The  mandate  to  report  on 
severity  modifications  to  DRGs  comprises  determining  whether  suitable 
adjustments  to  DRGs  are  available  and  advisable,  what  impact  such  adjustments 
might  have,  and  how  valid  the  case  mix  adjustment  system  may  be  without  severity 
adjustments  if  they  are  not  feasible. 

3.  Intensity  of  Care.  Intensity  of  care  refers  to  the  amount  of  medical  care 
provided  to  treat  patients  in  the  same  DRG  who  have  similar  severity  of  illness. 
Medically  appropriate  variation  in  intensity  is  extremely  difficult  to  distinguish 
from  inefficient  and  unnecessary  proliferation  of  services.  Nevertheless,  the 
distinction  is  critical  for  payment  purposes  because  Medicare  seeks  to  pay  for 
effective  care  and  discourage  inefficient  and  unnecessary  care.  This  report 
focuses  on  appropriate  variations  in  intensity  and  assumes  that  variations  which  are 
not  appropriate  are  manifestations  of  inefficiency. 

Intensity  reflects  factors  such  as  how  much  diagnostic  effort  is  necessary, 
whether  a  disease  will  be  treated  during  one  or  several  admissions,  how  intense  an 
effort  will  be  made  to  preserve  life,  and  whether  dying  patients  are  treated  in  a 


hospital  setting.  The  mandate  to  report  on  intensity  adjustments  to  DRGs  should, 
therefore,  include  an  estimate  of  the  importance  of  these  factors,  an  evaluation  of 
any  available  ways  to  quantify  them,  and  an  assessment  of  the  adequacy  of  the 
case  mix  adjustment  system  in  light  of  the  intensity  problem. 

4.  Efficiency.  Efficiency  is  the  relative  cost  of  delivering  a  given  level  of 
medically  appropriate  and  efficacious  care  to  a  patient.  Inefficiency  may  result 
from  providing  excessive  services,  from  producing  individual  services  in  an 
unnecessarily  costly  manner,  or  from  providing  services  in  an  ineffective  way.  The 
designers  of  PPS  believed  that  hospitals  were  not  equally  efficient  and  that 
incentives  for  efficiency  were  necessary  to  change  this  situation.  They  also 
realized  that  variations  in  cost  that  were  unexplained  by  the  PPS  case 
measurement  system  might  be  attributable  to  factors  other  than  inefficiency. 

These  definitions  provide  a  framework  for  interpreting  a  series  of  research 
findings  that  were  not  available  when  Congress  established  PPS  or  when  Medicare 
designed  the  outlier  payment  system.  These  findings  result  both  from  direct 
experience  with  hospital  operations  under  PPS  and  from  research  conducted 
primarily  to  support  this  report. 

II.   ANALYTIC  FINDINGS 
A.   OUTLIERS  ARE  A  KEY  TO  PAYMENT  EQUITY.     Analysis  of  the  distribution 
and  impact  of  outlier  payments  shows  that  outliers  play  a  critical  role  in  the  equity 
of  PPS  (Chapter  4.VI). 

1.  Outliers  account  for  the  majority  of  the  difference  between  charges  and 
payments  under  PPS.  In  the  first  year  of  PPS,  2.96  percent  of  cases  were  classified 
as  outliers,  but  those  cases  accounted  for  56.02  percent  of  the  difference  between 


hospital  charges  and  Medicare  payments  (Table  ES.3).    This  finding  suggests  that 
more  accurate  outlier  payments  might  improve  the  precision  of  PPS. 

2.  Outliers  have  very  high  costs,  high  charges,  and  high  losses.  Outlier  cases 
are  very  expensive:  2.96  percent  of  cases  account  for  17.19  percent  of  estimated 
operating  costs  and  17.66  percent  of  Medicare  covered  charges,  but  only  6.35 
percent  of  payments  (Table  ES.3).  Average  estimated  costs  for  outliers  were 
$16,819,  which  is  6.80  times  the  average  nonoutlier  cost  of  $2,473.  The  average 
payment  for  an  outlier  case  was  $7,248,  resulting  in  an  estimated  loss  of  $9,573. 
This  loss  contrasts  with  an  estimated  profit  of  $791  for  each  nonoutlier  case 
(Table  ES.3).  This  finding  suggests  that  payments  for  outliers  could  be  increased 
without  endangering  the  incentives  to  reduce  costs. 

3.  Outliers  appear  to  have  more  severe  illness.  Outlier  cases  appear  to  be 
sicker  than  nonoutliers,  not  only  when  their  total  illness  is  considered,  but  also  in 
proportion  to  their  longer  stays  (Table  ES.3).  Routine  daily  room  charges  are  38.0 
percent  of  the  bill  for  the  typical  patient  and  21.3  percent  for  outliers;  intensive 
care  charges  are  6.36  percent  of  typical  bills  and  11.76  percent  of  outlier  charges; 
and  ancillary  charges  are  55.7  percent  of  charges  for  nonoutliers  and  66.9  percent 
for  outliers.  This  finding  suggests  that  average  daily  costs  for  stay  outliers  are 
probably  more  than  60  percent  of  average  daily  costs  for  typical  cases. 

4.  PPS  identifies  stay  and  cost  outliers  in  similar  ways  but  pays  more  for  cost 
outliers.  Despite  the  very  large  difference  in  how  stay  and  cost  outliers  are 
identified,  total  charges  are  rather  similar  for  the  two  groups:     $26,151  for  stay 
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outliers  and  $26,012  for  cost  outliers,  as  were  estimated  costs  ($17,116  v.  $16,385). 
But  PPS  paid  an  estimated  51.3  percent  of  costs  for  cost  outliers  and  37.7  percent 
of  costs  for  stay  outliers  (Table  ES.3).  This  finding  suggests  that  the  distinction 
between  cost  and  stay  outliers  may  penalize  hospitals  when  a  very  expens.ve 
patient  stays  long  enough  to  be  paid  as  a  stay  outlier. 

5.  PPS  paid  less  than  planned  under  the  outlier  payment  system.  The 
Department  set  an  initial  payment  target  of  5J  percent  of  total  Medicare 
payments.  Later,  however,  with  industry  approval,  the  Department  restricted 
outlier  payments  to  the  portion  of  payments  which  were  not  based  on  the  hospital's 
historical  costs,  because  historical  costs  included  the  hospital's  tendency  to  have 
outlier  patients;  this  lowered  the  target  to  2.1  percent.  As  the  198^  Report  on  The 
Impact  of  the  Medicare  Prospective  Payment  System  noted,  actual  payments  under 
the  outlier  system  were  about  53  percent  of  the  target.  This  discrepancy  resulted 
from  large  decreases  in  average  length  of  stay  when  hospitals  came  under  PPS, 
which  reduced  the  number  of  patients  staying  long  enough  to  become  stay  outliers. 
Outlier  thresholds  were  changed  in  1985  to  increase  payments,  but  legislation 
delayed  these  changes  until  May  1986.  The  decreasing  role  of  the  hospital-specific 
portion  in  payments  will  also  increase  outlier  payments  per  case.  Available  data  do 
not  show  whether  a  change  in  the  legislated  target  is  appropriate. 

6.  Outliers  are  concentrated  in  large,  urban,  and  teaching  hospitals.  Outlier 
cases,  as  well  as  the  corresponding  difference  between  charges  and  payments,  were 
concentrated  in  large,  urban,  and  teaching  hospitals.  This  concentration  of  high 
risk    cases   is    more   than   offset    by    the    urban/rural    and    teaching    adjustments. 
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Because    urban/rural    and    teaching    adjustments    have    been    calculated    without 

considering   the   benefits    which    outlier    payments    give    to    urban    and    teaching 

hospitals  the  net  effect  of  outlier  payments  has  been  to  overpay  urban  and  teaching 

hospitals.    The  Omnibus  Budget  Reconciliation  Act  of  1986  (P.L.  99-509)  changed 

the    calculation    of    urban/rural    payments    to    remove    this    effect,    but    outlier 

payments  still  overpay  teaching  hospitals  and  contribute  to  their  high  operating 

margins. 

7.  Paying  "Marginal  Costs"  for  Outliers  is  Impractical  and    Unwise.    Marginal 

cost  is  the  added  cost  resulting  from  services  rendered  after  the  outlier  threshold 

is  reached.    The  congressional  mandate  to  pay  the  actual  marginal  cost  of  outlier 

cases  presents  very  difficult  technical  problems  because  the  date  when  costs  are 

incurred  cannot  be  determined  from  available  data.     Charge  data  from  the  first 

year  of  PPS  as  well  as  econometric  studies  suggest  that  the  estimates  of  marginal 

costs  used  for  implementation  were  too  low,  especially  for  cost  outliers  and  very 

long  stay  day  outliers,  but  precise  alternative  estimates  are  not  yet  possible.     If 

PPS    tried    to    pay   exactly    the    full   marginal   costs   of   care   beyond   the   outlier 

threshold,  the  inevitable     imprecision  of  estimating  actual  marginal  costs  would 

mean  that  about  half  of  the  hospitals  would  actually  make  money  on  outliers  once 

the  threshold  was  crossed.     Further,   Medicare  would  effectively  return  to  cost 

reimbursement    for   outlier   care:      there  would  be  no  incentive  for   hospitals  to 

contain  costs  once  the  outlier  threshold   was  crossed,  and   the   outlier    payment 

system  would  resemble  an  insurance  policy  for  the  hospital  which  did  not  require 

the  hospital  to  pay  coinsurance.     This  risk  is  significant  because  not  all  outlier 

costs   result   from   appropriate   care   for    which   Medicare   should   pay.      HCFA  is 

pursuing  research  on  ways  to  make  payments  closer  to  costs  for   patients  whose 

costs  grossly  exceed  the  current  outlier  payments  such  as  extremely  expensive  long 

stay  cases. 
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&.  Outliers  may  be  an  appropriate  area  in  which  to  experiment  with  severity 
measures.  Because  outlier  patients  do  have  illnesses  of  unusual  severity  and 
because  their  number  is  small,  they  may  be  the  ideal  group  on  which  first  to  apply 
measures  of  severity.  Such  measures  would  be  particularly  valuable  in 
distinguishing  between  outlier  cases  who  entered  the  hospital  with  very  large 
identifitble  needs  for  care  and  those  who  became  outliers  because  of  adverse 
events  during  hospitalization. 

B.  SEVERITY  OF  ILLNESS  IS  AN  IMPORTANT  DETERMINANT  OF  COST.  BUT 
NO  MAJOR  IMPROVEMENT  IN  SEVERITY  MEASUREMENT  IS  NOW  AVAILABLE. 
Since  Congress  enacted  PPS,  research  has  accumulated  on  the  relationship  of 
severity  of  illness  to  cost  of  care.  Severity  of  illness  is  dearly  an  important 
determinant  of  cost  of  care  in  individual  cases  as  shown  in  research  by  Brewster  et 
al.  (19i5b),  Coulter  et  al.  (19S3),  Horn  et  al.  (19^3,  19i.^),  the  National  Association 
of  Cnildren's  Hospitals  (1985),  Wagner  and  Draper  (198^*),  and  others.  The 
pref>onderance  of  evidence  now  indicates  that  severity  is  a  significant  determinant 
of  differences  in  cost  between  hospitals  (Cretin  and  Worthman  1986;  Conklin  et  al. 
19S5a;  Horn  et  al.  1985a,  1985b;  Brewster  unpublished  data).  Refinements  of  the 
DRG  system  appear  to  be  feasible,  but  major  improvements  are  not  yet  available 
(Chapter  3.VI11). 

DRGs  use  discharge  diagnoses,  procedures  perforimed,  patient  age,  and  (rarel>) 
other  information  to  assign  each  discharge  to  one  of  ^68  groups.  Although  DRGs 
measure  severity  of  illness  through  examining  principal  and  secondary  diagnosis, 
patient  age,  and  some  other  variables,  DRGs  do  not  use  physiological  measures  or 
Information  directly  abstracted  from  the  chart  to  determine  severity.  In  contrast 
to  some  measures,  the  DRG  algorithm  and  classification  programs  were  developed 
with  public  funds  and  are  in  the  public  domain.  HCFA  has  examined  five  systems 
lor  replacing  or  refining  DRGs  (Table  ES.k): 
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Disease  Staging  (Staging).  Staging  assigns  each  discharge  diagnosis  to 
one  ol  420  staged  disease  categories  and  to  one  of  four  stages  within  the 
disease  category.  The  stage  assignment  reflects  the  progressiw)  of  the 
disease.  As  a  replacement  for  DRGs,  it  has  less  ability  to  prrtict 
hospital  costs  accurately  (Table  ES.5  and  Coffey  and  Goldfarb  198^). 
Replacing  DRGs  with  Staging  would  discriminate  against  referral 
$urgical  centers  because  Staging  would  make  the  same  payment  whether 
or  not  surgery  was  performed.  The  current  version  of  the  Disease 
Staging  classification  progran  is  proprietary.  HCFA  is  supporting 
Studies  on  using  Disease  Staging  to  reljne  DRGs.  These  studjes  are 
promising,  but  incomplete  (Chapter  3.III). 

Patient  Management  Categories  (P\^Cs).  PMCs  use  discharge  diagnoses 
and  procedures  to  assign  a  patient  to  one  or  more  of  800  management 
categories.  This  system  has  been  proposed  as  a  replacement  for  DRGs, 
but  has  not  been  shown  to  perform  better  (Table  ES.5).  The 
classification  program  is  in  the  p-z!.:  domain  (Chapter  3.V). 
Severity  of  Illness  Index  (SDIl)  and  Computerized  Severity  Index  (CSl). 
SOU  is  a  system  for  assigning  a  severity  score  to  a  patient  at  discharge 
based  on  a  special  absfaction  of  the  mediCal  record.  The  developer  has 
proposed  using  SOU  within  DRGs  to  refine  DRG  payment.  SOU  has  a 
number  of  subjective  elements  which  would  create  major  administrative 
difficulties  in  a  payment  system.  SOU  may  also  tend,  more  than  other 
systems,  to  confound  care  actually  rendered  with  care  needed  both 
because  it  does  not  distinguish  severity  on  admission  from  severity  over 
the  course  of  hospitalization  and  because  it  uses  response  to  treatment 
as  a  measure  of  severity  of   illness.     SOU  has  not    been  tested  on   a 
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representative  sample  of  hospitals,  so  its  payment  impact  is  not 
predictable.  Altboug+i  the  SOn  instrument  is  in  the  public  domain,  the 
training  materials  which  are  critical  to  its  use  are  not  publicly  available. 
CSI  is  a  newer  system,  developed  by  the  same  investigator,  which 
appears  to  be  replacing  SOU.  It  requires  special  abstraction  of  the 
medical  record,  but  the  elements  abstracted  are  combined  by  computer 
to  give  a  score.  CSI  is  not  yet  fully  developed  and  has  had  only  very 
limited  tests  (Chapter  3.VI).  The  proprietary  status  of  CSI  is  not  yet 
clear. 

—  Acute  Physiology  and  Chronic  Health  Evaluation  (APACHE).  This 
physiologically  l)ased  system  assigns  a  patient  a  score  based  on  a  special 
abstracting  of  the  chart.  APACHE  has  been  proposed  to  refine  DRGs, 
but  it  has  rarely  been  used  on  patients  outside  of  intensive  care  units  and 
no  data  are  available  to  predict  its  impact  on  payments  to  hospitals. 
APACHE  is  in  the  public  domain  (Chapter  3.II). 

—  Medical  Illness  Severity  Grouping  System  (MEDISGRPS).  This  system 
uses  a  special  chart  abstract  to  assign  a  severity  score  to  each  patient  on 
admission  to  the  hospital.  MEDISGRPS  is  a  new  system  and  only 
preliminary  data  are  available  on  its  ability  to  predict  hospital  costs. 
MEDISGRPS  is  a  wholly  proprietary  system.  HCFA  is  supporting  further 
studies  of  this  system  (Chapter  3.IV)  at  the  Brandeis  Health  Policy 
Research  Center. 

The  Department  does  not  believe  that  any  system  to  measure  clinical  severity  of 
illness  is  currently  an  administratively  feasible  major  improvement  to  or  substitute 
for  DRGs.  Such  adjustments  are  desirable  for  purposes  of  measuring  quality  —  but 
for  payment  purposes  this  is  less  clear.    A  research  strategy  is  in  place  which  will 
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result  in  evaluations  of  available  severity  measures  over  the  next  3  years  and  may 
identify  severity  measures  that  can  be  incorporated  into  PPS. 
We  are  studying  DRG  refinements  These  include: 

—  more  sophisticated  use  of  the  additional  diagnoses  which  are  already 
avaOaWe  in  the  HCFA  data  system; 

—  more  precise  use  of  age  in  selected  DRGs; 

~       use  of  source  of  admission  for  selected  DRGs;  and 

—  use  of  Medicare  eligibility  status  for  selected  DRGs. 

C.  MEDICALLY  APPROPRIATE  VARIATIONS  IN  INTENSITY  OF  CARE  APPEAR 
TO  BE  AN  IMPORTANT  DETERMINANT  OF  COSTS.  A  growing  body  of  evidence 
suggests  that  medically  appropriate  variation  in  care  provided  to  patients  with 
comparable  severity  of  illness  accounts  for  significant  variation  in  cost  per  case 
among  patients,  between  hospital  types,  and  across  States  and  regions  (Chapters 
2.ni,  4.n  and  4.ni).  Direct  methods  to  measure  these  variations  are,  however, 
much  less  advanced  than  methods  to  measure  severity  of  illness  and  are  not  ready 
for  payment  applications. 

1.  Variation  Among  Patients.  Patients  with  similar  severity  of  illness  and  in 
similar  DRGs  receive  care  which  varies  in  intensity  for  a  number  of  medical 
reasons: 

—  Patients  whose  diagnosis  is  known  before  admission  appear  to  receive 
fewer  services  than  patients  who  must  be  t»th  diagnosed  and  treated 
during  the  admission.  Emergency  admissions  therefore  tend  to  be  more 
expensive  (Munoz  et  al.  1985). 

—  Treating  patients  aggressively  with  a  hope  of  survival  or  cure  is  usually 
more  expensive  than  managing  them  with  a  goal  of  palliating  symptoms 
or  relieving  pain.    Severity  of  illness  is  only  one  of  the  influences  on  how 
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doctor  and  patient  decide  whether  to  take  heroic  measures. 
—       The  treatment  given  patients  who  die  is  much  more  unpredictable  than 
the   treatment   of   patients   who    live,   because    some    die    quickly    with 
modest  costs  while  others  receive  extended  and  heroic  treatment  before 
they  die. 

Some  patients  are   treated  in  a  series  of  admissions  while  others  are 
treated  definitively  in  a  single,  longer  admission. 

In  general,  neither  the  severity  of  illness  nor  the  nature  of  the  illness 

predicts   the   amount   of   care   that    will    be   provided   to    patients    with 

chronic  disease.    Rather,  care  to  these  patients  is  influenced  by  what  has 

been  done  in  the  past,  what  treatments  remain  untried,  how  supportive 

an  environment  the  patient  lives  in,  and  what  patient  and  doctor  decide 

upon  together. 

2.    Differences    Across    Hospital    Types.      Much    of    the    difference    in    costs 

between  urban  and  rural  hospitals,  teaching  and  nonteaching  hospitals,  and  large 

and  small  hospitals  appears  to  reflect  differences  in  what  is  done  for  patients  with 

comparable   progression  of   illness  in   the  same   DRG   (Coffey   and   Goldfarb    19S^, 

Cromwell   1985)(Chapter  ^.III).     One  basis  of  this  conclusion  is  a  series  cjf  studies 

showing  that  patients  in  less  expensive  hospitals  have  disease  progression  similar  to 

those  in  more  expensive  hospitals,  but  that  patients  in  less  expensive  hospitals  are 

less  likely  to  receive  expensive  treatments.    This  conclusion  is  analogous  to  studies 

showing  that  geographic  variation  in  patterns  of  treatment  is  much  greater  than 

epidemiologic  variation  in  underlying  disease. 

Other  evidence  points  in  this  direction.  Fo-  example,  we  have  already  noted 
that  hospital  size  is  significantly  related  to  cost  per  discharge  even  after  adjusting 
for  differences  in  location,  wage  costs,  and  teaching  intensity.    The  effect  of  size 
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on  costs  largely  reflects  the  fact  that  hospitals  with  100-^9  beds  are  17.6  percent 
more  expensive  than  hospitals  with  fewer  than  51  beds  (Table  ES.2).  By  contrast, 
hospitals  with  more  than  399  beds  are  only  6.3  percent  more  expensiye  than 
hospitals  with  100-U9  beds.  The  dramatically  lower  costs  in  very  small  hospitals 
are  most  unlikely  to  reflect  greater  efficiency;  rather,  these  hospitals  probably 
provide  less  intensive  care  for  their  patients  and  transfer  many  patients  with 
complex  illnesses  to  larger  hospitals  (Chapter  '*.VI1). 

These  findings  are  consistent  with  commonly  held  views  that  certain  kinds  of 
hospitals  provide  more  aggressive  and  intensive  care  than  other  hospitals.  For 
example,  many  severely  ill  patients  are  transferred  to  tertiary  care  facilities,  but 
such  transfers  would  appear  to  focus  on  trauma,  burn,  and  neonatal  intensive  care 
units  rather  than  less  specialized  services.  Many  other  patients  are  referred  to 
tertiary  care  facilities  in  early  stages  of  disease  either  because  of  diagnostic 
uncertainty  or  for  aggressive,  technologically  sophisticated  or  curative  treatment 
of  diseases  ranging  from  congenital  malformations  to  cancer.  By  contrast,  there  is 
no  evidence  that  dying  patients  are  more  likely  to  be  admitted  to  tertiary  care 
facilities  than  to  other  hospitals  (Chapter  '♦.II,  ^.IV). 

The  availability  of  advanced  technology,  trained  specialists,  and  experienced 
support  staff  leads  to  provision  of  medically  appropriate  services  in  some  hospitals 
which  are  much  less  available  in  other  hospitals.  Whether  large  nonteaching 
hospitals  are  more  like  community  hospitals  or  like  large  teaching  hospitals 
remains  ambiguous  (Chapter  '♦.VII)  and  may  depend  upon  the  community  standards 
to  which  such  hospitals  are  held  by  physicians  and  patients. 

An  important  result  of  the  very  different  way  teaching  and  nonteaching 
hospitals  care  for  patients  is  that  the  most  profitable  DRGs  in  the  former  are  not 
the    most    profitable    in    the    latter    (Cretin    et    al.    1986).      Knowing   the    relative 
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profitability  of  a  DRG  in  a  teaching  hospital  predicts  less  than  half  of  the  variation 
in  its  relative  profitability  in  a  nonteaching  facility.  This  sugg,ests  that  teaching 
hospitals  are  not  simply  doing  more,  but  practicing  a  different  pattern  of  medicine. 
Available  data  do  not  permit  classifying  more  intense  care  as  either  having  Higher 
quality  or  being  less  efficient. 

3.  Differences  Across  States  and  Regions.  Health  services  appear  to  be 
organized  and  deli/ered  in  ways  which  vary  dramatically  across  States  and  regions, 
and  these  differences  appear  to  affect  costs  (Chapter  '^.III): 

The  number  of  beneficiaries  who  die  within  30  days  of  admission  does  not 
vary  greatly  across  States,  but  the  relative  number  of  beneficiaries  who 
die  inside  or  outside  of  a  hospital  during  that  period  depends  heavily  on 
average  length  of  hospital  stay  in  the  State.  In  States  where  average 
stays  are  short,  more  patients  appear  to  receive  terminal  care  in 
nonhospital  settings  (Table  ES.6). 

The  rate  at  which  surgery  is  performed  varies  widely  across  small  and 

large  regions.     Likewise,  the  rate  of  hospital  admissions  varies  across 

regions.    Areas  in  which  hospital  admission  rates  are  lower  tend  to  have 

higher  costs  per  admission  (Gornick  19S2).     Although  severity  of  illness 

may  account  for  a  portion  of  this  variation,  there  is  no  evidence  that  it 

does  so.    Likewise,  there  is  as  yet  no  evidence  that  appropriate  criteria 

for  admission  can  be  made  sufficiently  clear  to  abolish  these  forms  of 

interstate  variation,  even  though  they  can  almost  certainly  be  reduced. 

D.   PPS   PAYS   APPROPRIATELY   FOR    CARE   TO  VULNERABLE  BENEFICIARY 

GROUPS.     PPS  does  not  appear  to  create  payment  incentives  which  discriminate 

against  vulnerable  beneficiary  groups  such  as  the  poor,  the  very  old,  or  the  disabled 

(Chapter    3.V11).       Charges    for    admissions    of    beneficiaries    over    80,    over    85, 
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previously  hospitalized,  and  disabled  and  charges  for  admissions  of  beneficiaries 
also  eligible  for  Medicaid  are  very  similar  to  charges  for  admissions  of  other 
Medicare  l>eneficiaries  to  the  same  hospital  and  in  the  same  DRG  (Table  ES.7). 
Ttie  greatest  discrepancy  is  that  charges  for  patients  admitted  from  nursin'g  homes 
are  about  7  percent  higher  than  charges  for  other  t>eneficiaries.  Despite  the 
generally  small  differences  found,  a  number  of  these  patient  characteristics  might 
l>e  incorporated  into  the  DRG  system  to  improve  specific  DRGs.  The  findings  are 
restricted  to  differences  between  patients  in  the  same  hospital  and  do  not  provide 
information  on  the  relative  costs  of  hospitals  which  have  a  disproportionate  share 
of  any  of  these  beneficiary  groups. 

E.  EVEN  WITHOUT  A  MORE  REFINED  CASE  MIX  ADJUSTMENT,  HOSPITALS 
GENERALLY  DID  WELL  IN  THE  FIRST  YEAR  OF  PPS.  Although  the  findings 
sumnarized  to  this  point  raise  the  possibility  that  unmeasured  severity  and 
intensity  and  low  outlier  payments  may  compromise  the  PPS  case  mix  adjustment 
system,  empirical  data  are  necessary  to  determine  whether  the  various  other 
adjustments  in  PPS  compensate  for  these  problems. 

HCFA  examined  Medicare  cost  reports  for  hospitals'  first  year  under  PPS  and 
the  impact  of  recent  payment  changes  on  the  payment  which  would  be  made  under 
fully  national  rates  starting  in  1988.  Some  of  the  cost  reports  were  unaudited,  but 
analysis  suggested  that  the  overall  picture  was  similar  to  that  which  could  be 
expected  from  the  final  audited  reports. 

1.  Operating  Margins.  In  the  first  year  of  PPS,  Medicare  payments  for 
operating  expenses  of  inpatient  care  exceeded  operating  costs  for  that  care  by  16.4 
percent.  Under  national  rates,  we  project  that  Medicare  payments  for  inpatient 
care  will  exceed  hospital  costs  by  12.4  percent.  Because  smaller  hospitals  tended 
to  have  lower  margins  than  larger  hospitals,  the  average  hospital  had  a  Medicare 
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operating  margin  of  12.01  percent  in  its  first  PPS  year  and  has  a  projected 
operating  margin  of  12.14  percent  under  national  rates.  The  American  Hospital 
Association  reports  sharply  lower  overall  hospital  operating  margins^  but  the 
differences  appear  to  reflect  different  accounting  procedures  and  the  fact  that 
HCFA  data  is  limited  to  Medicare  patients'.  Hospitals  generally  realize  greater 
margins  on  Medicare  beneficiaries  than  on  other  patients. 

Projections  for  future  years  are  t>ased  on  an  assumption  that  the  only  change 
in  cost  per  case  wQl  t>e  that  resulting  from  inflation  in  the  hospital  market  basket. 
However,  hospitals  can  and  do  change  their  costs  dramatically  in  response  to  PPS 
incentives.  In  the  first  year  of  PPS,  the  average  hospital  reduced  its  costs  16.4 
percent  compared  to  the  costs  which  HCFA  had  anticipated  (because  HCFA 
overestimated  costs,  the  actual  economies  were  somewhat  less).  Under  fully 
national  rates,  and  assuming  that  operating  costs  increase  exactly  at  the  rate  of 
inflation,  we  project  that  23.3  percent  of  hospitals  will  have  an  operating  loss, 
while  26.0  percent  will  have  an  operating  profit  greater  than  30  percent. 

The  profitability  of  Medicare  business  resulted  from  four  factors  which  may 
not  be  repeated: 

—  The  hospital  markett>asket  inflated  less  than  expected. 

—  Changes    in    diagnosis    and    procedure    coding    increased    DRG-based 
payments  more  than  had  been  expected. 

—  Pressure  of  time  forced  HCFA  to  set  payments  by  using  unaudited  cost 
reports  which  overstated  base  year  costs. 

—  Hospitals  were  able  to  achieve  significant  efficiencies  under  the  new 
PPS  incentives. 

To  lM*ing  Medicare  payments  more  in  line  with  costs,  the  Department  has  proposed 
PPS  update  factors  to  correct  for  overestimate  of  inflation,  changes  in  coding,  and 
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elimination  of  inefficient  practices.   Congress  has  enacted  higher  updates  than  the 
Department  has  proposed. 

2.  OveraU  Correlation  Between  Costs  and  Payments.  In  the  first  year  of  PPS, 
payments  were  closely  correlated  with  costs.  Average  payment  per  case  at  the 
hospital  level  reflected  89.0  percent  of  the  variation  in  hospital  average  cost  per 
case.  Under  fully  national  rates,  the  correlation  is  less  close,  but  we  project  that 
payments  per  case  at  the  hospital  level  will  still  reflect  71.4  percent  of  variation  in 
hospital  average  cost  per  case. 

3.  The  Range  of  Variation  in  Operating  Margin  Among  Hospitals.  Among 
hospitals  there  are  wide  variations  in  operating  margins.  In  the  first  year  of  PPS, 
19.4  percent  of  hospitals  had  an  operating  loss  while  9.5  percent  had  an  operating 
profit  greater  than  30  percent.  Under  fully  national  rates,  we  project  that  23.3 
percent  of  hospitals  will  have  an  operating  loss,  while  26.0  percent  will  have  an 
operating  profit  greater  than  30  percent.  In  the  following  analysis  we  discuss  the 
variation  in  operating  margin  across  classes  of  hospitals,  but  the  reader  should 
notice  that  Tables  ES.l  and  ES.2  show  that  the  relation  of  hospital  characteristics 
to  costs  (and  hence  to  margins)  depends  heavily  on  which  combination  of 
characteristics  which  we  consider. 

4.  Urt>an/Rural.  In  the  first  year  of  PPS,  Medicare  losses  were  concentrated  in 
rural  hospitals,  but  the  Omnibus  Budget  Reconciliation  Act  of  1986  made  changes 
in  the  payment  formula  which  eliminated  this  difference.  Under  fully  national 
rates,  we  project  that  23.2  percent  of  rural  hospitals  (compared  to  an  overall  rate 
of  23.3  percent)  will  have  an  operating  loss,  and  26.2  percent  of  rural  hospitals 
(compared  to  an  overall  rate  of  26.0  percent)  are  projected  to  have  an  operating 
profit  greater  than  30  percent. 

5.  Teaching/Nonteaching.  In  the  first  year  of  PPS,  teaching  hospitals  did 
substantially  t>etter  than  nonteaching.  While  22.4  percent  of  nonteaching  hospitals 
had  losses  and  7.9  percent  had  profits  at>ove  30  percent,  only  5.4  percent  of  light 
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teaching  hospitals  (less  than  0.25  residents  per  bed)  and  4.10  percent  of  heavy 
teaching  hospitals  has  Josses,  while  lU.U  percent  of  light  teaching  hospitals  and  32.0 
percent  of  heavy  teaching  hospitals  had  profits  above  30  percent.  Recent 
legislation  has  reduced  the  teaching  adjustment,  adjusted  the  outlier  system, 
changed  the  urban/rural  differential,  and  created  a  disproportionate  share 
adjustment,  resulting  in  a  net  reduction  in  the  advantages  of  teaching  hospitals. 
We  project  that,  under  fully  national  rates,  23. 'f  percent  of  nonteaching  hospitals, 
23. S  percent  of  light  teaching  hospitals,  and  18.7  percent  of  heavy  teaching 
hospitals  will  have  operating  losses;  26.7  percent  of  nonteaching  hospitals,  20. k 
percent  of  light  teaching  hospitals,  and  33.7  percent  of  heavy  teaching  hospitals 
will  have  operating  profits  above  30  percent. 

6.  Hospital  Size.  Hospital  size  is  directly  related  to  profitability,  but  the 
relationship  reverses  between  the  first  year  of  PPS  and  projected  experience  under 
national  rates.  In  the  first  year,  for  example,  35.1  percent  of  hospitals  with  fewer 
than  50  beds  had  an  operating  loss,  while  8.6  percent  had  an  operating  profit  over 
30  percent;  no  hospital  over  599  beds  had  a  loss  in  the  first  year  while  26.7  percent 
had  an  operating  profit  over  30  percent.  By  contrast,  we  project  that  under 
national  rates  21.1  percent  of  hospitals  with  fewer  than  50  beds  would  have  an 
operating  loss  while  39.3  percent  would  have  an  operating  profit  above  30  percent; 
IS.l  percent  of  hospitals  over  599  beds  would  have  a  loss  while  21.6  percent  would 
have  more  than  30  percent  profit.  The  reversal  appears  to  reflect  the  lower 
historical  costs  of  small  hospitals  and  their  inability  to  cut  costs  as  much  as  larger 
hospitals.  In  the  first  year  of  PPS,  high  operating  margins  resulted  mainly  from 
hospitals  doing  well  compared  to  their  own  historical  costs;  at  full  implementation, 
high  operating  margins  will  reflect  how  hospitals  do  compared  to  other  hospitals. 

7.  Variation   Across  Regions.     Operating  margins  vary  across  regions  and  will 
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vary  more  under  fully  national  rates,  even  after  adjusting  for  differences  in 
hospital  size,  teaching  activity,  case  mix,  and  wages.  Table  ES.  10  shows  that  in 
the  first  PPS  year  the  mid-Atlantic  States,  Illinois,  and  Michigan  had  operating 
margins  atx>ut  4-5  percentage  points  higher  than  New  England,  Kentucky, 
Tennessee,  Alabama,  Alaska,  and  Hawaii,  with  margins  in  other  States  falling 
between.  We  project  that  under  fully  national  rates  hospitals  in  Iowa,  Kansas, 
Mississippi,  Missouri,  Nebraska,  North  Dakota,  South  Dakota,  and  Utah  will  have 
margins  about  10  percentage  points  higher  than  comparable  hospitals  in  Arizona, 
Florida,  Illinois,  Michigan,  Nevada,  and  New  Mexico.  Because  these  differences 
reflect  differences  in  margins  for  hospitals  with  similar  characteristics  while 
average  hospitals  have  different  characteristics  in  different  States,  the  table  does 
not  portray  differences  between  the  average  hospital  in  different  States.  The 
difference  in  patterns  from  the  first  PPS  year  to  national  rates  reflects  the  same 
effects  noted  in  discussing  large  and  small  hospitals:  hospitals  which  were 
historically  expensive  do  well  when  they  compete  against  their  record  but  much 
less  well  than  they  compete  against  national  standards. 

F.  PPS  DATA  INTEGRITY  IS  GOOD.  The  data  on  which  the  PPS  case  mix 
adjustment  system  operates  appear  to  have  enough  integrity  for  the  equitable 
operation  of  the  program  (Chapters  2  and  3).  The  substantial  change  in  coding 
which  occurred  at  the  time  of  implementation  of  PPS  is  consistent  with  improved 
data  quality,  (Chapter  2.IV.F).  DRG  weights  may  be  too  low  for  complex  cases  and 
too  high  for  simple  cases,  but  it  is  not  clear  that  this  "weight  compression"  has 
actually  resulted  in  inaccurate  payments  (Chapter  4. IV).  Although  the  data  used  to 
calculate  the  original  DRG  weights  included  significant  amounts  of  unnecessary 
care,  that  data  weakness  does 

not  appear  to  have  resulted  in  inaccurate  DRG  weights  (Chapter  4.IV). 


2« 


G.  RESEARCH  EFFORT.  Before  Congress  enacted  PPS,  HCFA  spent  more  than 
$9.4  million  on  case  mix  research;  HCFA  has  spent  $S.l  million  more  since  then. 
Although  this  report  cJocuments  impressive  prog  ess  on  a  number  of  fronts,  there  is 
DO  one  "solution*^  to  the  problem  and  the  need  for  a  number  of  partial  solutions  is 
Increasingly  clear.  HCFA  has  already  invested  heavily  in  a  number  of  the  most 
promising  options. 

H.  QUALITY  OF  CARE  AND  CASE  MIX  ADJUSTMENT.  Case  mix  adjustment  has 
l)oth  a  direct  and  an  indirect  application  to  quality  of  care.  Directly,  more 
accurate  case  mix  adjustment  will  permit  paying  hospitals  the  amounts  necessary 
to  deliver  quality  care  and  to  assure  beneficiaries  access  to  quality  care. 
Indirectly,  and  perhaps  equally  important,  the  same  measures  of  case  mix  which 
permit  more  accurate  payment  to  hospitals  also  p>ermit  more  accurate  comparisons 
of  health  care  outcomes  across  hospitals  by  allowing  investigators  to  assure  that 
comparisons  between  hospitals  are  made  for  patients  who  are  truly  similar. 
I.  CASE  MIX  ADJUSTMENT  AND  CAPITATION  RATES.  Research  on  case  mix 
adjustment  promises  to  make  several  contributions  to  setting  more  accurate 
payments  for  capitated  health  care  systems  such  as  HMOs.  While  case  mix 
refinement  based  on  acute  severity  of  Qlness  has  limited  application  to  capitated 
rates,  study  of  the  way  chronic  illness  affects  hospital  costs  offers  direct 
application  to  setting  capitation  rates. 

III.  CONCLUSIONS 
The  findings  lead  to  three  significant  policy  conclusions: 

A.  THE  MEANING  OF  DIFFERENCES  BETWEEN  COSTS  AND  PAYMENTS. 
Because  we  do  not  have  satisfactory  measures  of  severity  of  illness  or  satisfactory 
ways  to  assess  what  intensity  of  care  is  medically  appropriate,  we  cannot  resolve 
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the  questions  of  which  differences  between  costs  and  payments  are  appropriate. 
Clearly,  perfect  correlation  between  costs  and  payments  would  be  equivalent  to 
cost-based  reimbursement;  because  there  still  appears  to  be  substantial  room  for 
Increasing  efficiency  in  the  hospital  industry,  payments  which  perfectly 'reflect 
costs  would  not  t>e  a  desirable  goal.  On  the  other  hand,  we  cannot  reach  a 
technical  judgment  whether  the  degree  of  match  between  costs  and  payments 
which  we  have  found,  and  which  we  project  to  occur  under  national  rates,  is  the 
correct  one.  The  differences  between  costs  and  payments  may  reflect  correctable 
inefficiencies  for  which  Medicare  should  not  pay;  if  this  is  true,  then  the  high 
operating  margins  realized  by  some  hospitals  may  be  useful  incentives  to  other 
hospitals  to  become  more  efficient.  On  the  other  hand,  the  differences  l)etween 
costs  and  payments  may  mean  that  hospitals  with  severely  ill  patients  and 
medically  appropriate  intense  care  are  being  penalized  by  a  system  which  does  not 
fully  measure  their  special  responsibilities.  Current  technology  does  not  allow  us 
to  determine  which  of  these  two  scenarios  is  most  correct. 

During  the  first  years  of  PPS,  most  hospitals  were  protected  from  possible 
payment  imprecision  under  national  rates  not  only  by  the  blend  of  regional  and 
hospital-specific  rates  but  also  by  high  payments  relative  to  historical  costs  and 
substantial  gains  in  efficiency.  In  the  immediate  future,  a  growing  numt>er  of 
hospitals  may  t>e  pressed  by  an  increasingly  competitive  health  care  environment. 
Hospitals  will  have  fewer  alternative  sources  of  income  to  which  they  can  turn  if 
Medicare's  payments  are  too  low.  On  the  other  hand,  Medicare  currently  provides 
average  margins  in  excess  of  12  percent,  which  leaves  substantial  protection 
against  any  error. 

Under  these  circumstances,  continuing  close  monitoring  of  the  impact  of  PPS 
on  providers  and  patients  l>ecomes  especially  important,  and   close   cooperation 
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between  Congress  and  the  Administration  in  charting  the  future  course  of  the 
program  wDl  be  essential.  The  refinements  of  case  mix  measurement  which  could 
Improve  payment  for  severity  and  complexity  are  also  the  research  tools  necessary 
to  determine  whether  PPS  payments  are  equitable;  further  refinement  of  case  mix 
Adjustment  methods  is  an  integral  part  of  the  future  of  PPS. 

B.  IMPROVING  CASE  MIX  ADJUSTMENT  THROUGH  OUTLIER  POLICY.  One 
way  to  improve  accuracy  of  payment  at  the  individual  case  level  is  to  pay  more 
precisely  for  outlier  cases.  Current  law,  which  requires  paying  the  full  marginal 
costs  of  care  of  outliers,  raises  technical  problems  which  cannot  t>e  fully  resolved 
and  provides  no  incentives  for  efficiency  once  a  case  passes  the  outlier  threshold. 
HCFA  is  actively  pursuing  research  on  refining  outlier  payments. 

C.  IMPROVING  CASE  MIX  ADJUSTMENT  THROUGH  DRG  REFINEMENT. 
Although  major  immediate  improvements  in  the  DRG  system  are  not  possible, 
modest  improvements  have  already  t)een  made  and  others  appear  feasible.  The 
latter  include  using  Disease  Staging  to  refine  DRGs,  more  sophisticated  use  of  age 
in  DRGs,  and  better  use  of  comorbidities  and  complications.  In  the  longer  term, 
use  of  physiologic  measures  and  further  adjustment  for  intensity  of  care  appear  to 
be  productive  strategies. 

IV.  FURTHER  PLANS 
These  findings  and  conclusions  lead  to  the  following  program: 

A.  CONTINUE  THE  PRESENT  DRG  UPDATE  SYSTEM.  HCFA  will  continue  its 
present  system  for  DRG  updates,  which  has  led  to  appropriate  progressive  changes 
In  the  system. 

B.  HCFA  will  continue  to  study: 
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—  Approaches  for  refinements  to  DRGs  t>ased  on  more  precise  use  of 
comorbidity  and  complication  data.  Tliese  refinements  will  develop  from 
work  at  Systemetrics  using  Disease  Staging,  work  at  Yale  using  a  variety 
of  other  systems,  or  some  combination  of  the  two. 

—  Approaches  for  changes  in  teaching  adjustments  to  take  account  of  outlier 
payments. 

C.     SUBSEQUENT  REFINEMENTS; 

—  Evaluate  the  application  of  severity  of  illness  measures  to  payment  for 
outlier  cases. 

—  Analyze  pilot  studies  of  severity  measures  to  determine  whether  and  at 
what  cost  they  provide  a  general  impw*ovement  in  case  mix  adjustment. 

V.  ORGANIZATION  OF  THE  REPORT 
This  report  is  designed  to  develop  a  framework  for  analyzing  outliers,  severity,  and 
intensity;  to  present  a  substantial  body  of  new  research,  much  of  it  carried  out  for 
this  report;   and  to   document   the   detaDed  evidence  and   reasoning   t)ehind   the 
recommendations  contained  in  this  Executive  Summary.  The  report  provides: 

—  a  review  of  the  goals  and  organization  of  PPS  (Chapter  1); 

—  a  discussion  of  the  reasons  for  variation  in  need  for  care  among  individual 
patients  and  the  problems  of  measuring  that  need  (Chapter  2); 

—  a  summary  of  all  available  research  on  alternatives  to  and  improvements 
to  DRGs  (Chapter  3); 

—  an  examination  of  the  reasons  for  variation  in  average  need  for  care 
among  the  patients  of  different  hospitals  and  of  the  impact  of  PPS  and 
outlier  payments  on  hospitals  (Chapter  4);  and 

—  an  agenda  for  improving  case  mix  measurement  in  the  next  5  years 
(Chapter  5). 
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Table  ES.l 


PREDICTORS  OF  HOSPITPJL  AVERAGE  CDST  PER  MEDICARE  CASE 
Percentage  Increase  for  Every  1.0  Percent  Increase  in  Variable 


1984  Costs 


1981  Costs 

A:  Basic  IVbdel 

Case  Mix  Index 

1.014 

Teaching* 

0.579 

Wage  Index 

1.019 

Large  SMSA 

11.60 

Medium  SMSA 

2.76 

Small  SMSA 

0.10 

Number  of  Beds 

0.119 

0.954 

0.583 

0.959 

9.91  ) percent 

3.78  ) higher 

2.54  )than  rural 

0.086 


R^ 


0.725 


0.702 


B:  Without  Hospital  Size  (beds) 


Case  Mix  Index 

1.464 

Teaching* 

0.845 

Wage  Index 

0.980 

Large  SMSA 

23.18 

Medium  SMSA 

11 .  96 

Small  SiyiSA 

9.07 

R^ 


0.692 


1.295 

0.766 

0.863 

19.97  ) percent 
11.34  ) higher 

9.35  )than  rural 

0.682 


C:  Coefficients  Reflecting  Actual  1984  Payment  Procedures 


Case  Mix  Index 
Teaching* 
Wage  Index 
Urban 


1.000-" 
1.158^. 
0.7915- 
25.34 


0.666 
Value  mandated  by  law  or  regulation. 


1.000-" 
1.158^ 
0.7915^ 
18.70  percent  higher 
than  rural 
0.669 


*  Teaching  is  defined  as  the  ratio  of  interns  and  residents  to  hospital 
beds. 

1981  (N=4971):  hospitals  with  >49  Medicare  discharges  in  1981  and  any  1984 
discharges. 

1984  (N=5349):  hospitals  with  >49  IVfedicare  discharges  and  with  available 
Cost  Reports. 


(continued) 


Table  ES.l 
(continued) 


PE?EDICrORS  OF  HOSPITM.  AVER?^GE  COST  PER  CASE 
Percentage  increase  for  Every  1.0  Percent  Increase  in  Variable 


1981  Costs 


D:  With  Hospital  Bed  Size  Groups 


Case  Mix  Index 

1.097 

Teaching* 

0.742 

Wage  Index 

1.001 

Large  SMSA 

13.21 

Medium  SMSA. 

3.91 

Small  SMSA 

1.04 

51-100  Beds 

14.11 

101-250  Beds 

25.24 

251-500  Beds 

29.55 

More  than  500  beds 

32.81 

1984  Costs 


1.003 
0.702 
0.943 
10.57  )) 
4.42  )) 
87  )) 


percent 
higher 
than  rural 


0.725 


2 

9.10)  percent 

18.27)  higher  than 

21.07)  1-50  bed 

21.15)  hospitals 

0.702 


E: 


Case  Mix  Index 


R^ 


2.851 
0.401 


2.298 
0.423 


F: 


Case  Mix  Index 

1.469 

Teaching* 

0.899 

Wage  Index 

1.436 

1.376 
0.846 
1.191 


R" 


0.670 


0.669 


*  Teaching  is  defined  as  the  ratio  of  interns  and  residents  to  hospital 
beds. 

1981  (N=497l):  hospitals  with  >49  Medicare  discharges  in  1981  and  any  1984 
discharges. 

1984  (N=5349):  hospitals  with  >49  Medicare  discharges  and  with  available 
Cost  Reports. 


Source:  Medicare  Cost  Reports. 


Table  ES.2 


PREDICTORS  OF  HOSPIT?^  AVERAGE  COST  PER  MEDICARE  CASE  IN  1984 


Variable 


Effect^ 


Cumulative^  R^ 


Wage  Index 
Case  Mix  Index 
Teaching  Activity 
Urban  location 
Disproportionate  share-^ 


Wage  Index 
Case  Mix  Index 
Teaching  Activity 
Urban  location 
Disproportionate  share~ 
Outlier  payments 


0.971  0.487 

1.275  0.641 

0.686  0.661 

11.068  0.672 

0.719  0.674 

Including  Outlier  Payments 

0.891  0.487 

1.253  0.641 

0.336  0.661 

6.716  0.672 

0.133  0.674 

2.571  0.737 

Controlling  for  Hospital  Size 


Wage  Index 

1.031 

0.482 

Case  Mix  Index 

0.951 

0.641 

Teaching  ?ctivity 

0.618 

0.661 

Urban  location 

3.821 

0.672 

Disproportionate  share-^ 

0.497 

0.674 

50-99  beds 

9.771          1 

100-149  beds 

17.626           ] 

150-199  beds 

18.338          1 

1    0.696 

200-399  beds 

22.308 

more  than  399  beds 

25.028 

Effects  correspond  to  coefficients  in  a  log-linear  regression  model 
which  includes  all  of  the  variables.  For  continuous  variables  (wage 
index,  case  mix  index,  teaching  activity,  disproportionate  share,  and 
outlier  payments)  the  effect  is  the  percent  increase  in  cost  corresponding 
to  a  1.0  percent  increase  in  the  variable.  For  dumrry  variables  (urban 
location  and  bed  size)  the  effect  is  the  percent  increase  in  cost  per  case 
if  hospital  is  urban  or  if  it  has  the  indicated  number  of  beds,  caipared 
to  a  rural  hospital  or  to  one  with  1-49  beds. 

9.9.  .  ... 

Cumulative  R  is  the  portion  of  the  total  variance  m  hospital 
average  cost  per  case  which  is  accounted  for  in  a  stepwise  regression  by 
variables  down  to  the  given  value.  All  hospital  size  dummies  are  entered 
together. 

?     .       .  .     .  .       . 

The  disproportionate  share  variable  is  the  percentage  increase  m 

payments  to  the  hospital  which  is  defined  by  legislation;  the  coefficient 

indicates  the  percent  increase  in  cost  which  corresponds  to  a  1.0  percent 

increase  in  payments  required  by  law. 

Outlier  payments  are  the  fraction  of  payments  for  the  hospital  made  to 
the  hospital  estimated  from  casemix  during  the  first  year  of  PPS  and 
payment  rules  for  Federal  Fiscal  Year  1986  but  assuming  full 
irrplementation  of  PPS. 


Source:  Medicare  Cost  Reports.  N=5350  hospitals. 


Table  ES.3 


Characteristics  of  Medicare  Outlier  Cases 
Cases  Paid  Under  PPS  in  Federal  Fiscal  Year  1984 


Tvpe 

Of  Outlier 

Non- 

7U1 

Length  of 

PPS 

Variable 

Stay 

Cost 

All 

Outliers 

Cases 

PPS  Discharges 

1.76% 

1.21% 

2.96% 

97.04% 

100.00% 

Urban  Hospitals 

2.07 

1.45 

3.52 

96.48 

100.00 

Rural  Hospitals 

0.98 

0.60 

1.58 

98.42 

100.00 

Teaching  Hospitals 

2.58 

1.24 

3.82 

96.18 

100.00 

Nonteaching  Hospitals 

1.34 

1.19 

2.53 

97.47 

100.00 

Hospital  Days 

9.17 

3.34 

12.51 

87.49 

100.00 

Total  Charges 

10.49 

7.17 

17.66 

82.36 

100.00 

Est.  Operating  Costs 

10.38 

6.80 

17.19 

82.81 

100.00 

Medicare  Payments 

3.35 

3.00 

6.35 

93.65 

100.00 

Charges  less  Payments 

34.71% 

21.32 

56.02 

43.98 

100.00 

Special  care  charges 
(percent  of  total) 

Ancillary  charges 

(percent  of  total) 


10.10  14.20  11.76  6.36 


62.12  63.89  62.83  55.68 


7.32 


56.48 


Length  of  Stay 
Total  Charge 
Charge  per  Day 
Est.  Operating  Cost 
Medicare  payment 
Est.  Operating  IMbrgin 


40.9  21.7  33.1 

$  26,151  $26,012  $26,094 

$    639  $1,200  $788 

$  17,116  $16,385  $16,819 

$  6,457  $8,399  $7,248 

$-10,658  $-7,989  $-9,573 


7.1  7.8 

$  3,714  $  4,377 

$  526  $   559 

$2,473  $2,898 

$3,262  $3,380 

$+791  $+484 


Source:  Stuart  Guterman,  Health  Care  Financing  Administration, 
unpublished  data. 


Table  ES.4 
BASIC  E^EATURES  OF  ALTERISIAnVE  C2\SE  MIX  MEASUREJyiEOT?  SYSTEMS 


SYSTEl^ 
NAME 

CREATOR 

ORGAN- 
IZATIONf 

ORDINAL/     NUMBER  OF      DATA 
NCN-        GROUPS/SCAT  F,  REQUIRE-    PRO- 
ORDINAL*       POINTS      MEWTS    PRIEIARY 

Diagnosis- 
Related 
Groups 
(DRGs) 

Thonpson 
Fetter 
Aver ill 

Yale 

Nonordinal 

468 

UHDDS 

No 

Acnjte       Knaus 
Physiology   Wagner 
and  CJironic 
Health  Evaluation 

(APAOiE) 

George 

Washington 

University 

Ordinal 

71 

specially 
abstracted 
physiologic 
data 

No 

Disease 
Staging 
(Staging) 

Gonella 

Thomas 

Jefferson 

University 

Nonordinal** 
4 

420       UHDDS 
groups 
levels  each) 

Software 
yes; 

architect 
ure  no 

Medical 
Illness 
Severity 
Grouping 
System 
(MHIDISGRPS) 

Brewster  MediQnal 
Inc. 

Ordinal 

5 

specially 
abstracted 
history  and 
physiologic 
data 

Software 
and  archi 
tecture 
secret . 

Patient 
Management 
Categories 
{mcs) 

Young 

Blue  Cross  Nonordinal*** 
of  Western 
Pennsylvania 

720 

UHDDS 

No 

Severity 
of  Illness 
Index 
(SOU) 

Horn 

Johns 

Hopkins 

University 

Ordinal 

5 

specially 
ccxied  scales 
"iirplicitly" 

integrated 

No,  but 

training 

manuals 

not 

public 

Computer- 
ized 

Severity 
Index 
(CSI) 

Horn 

Johns      Nonordinal** 

Hopkins  & 

Health 

Systems 

International 

5 

specially 
abstracted 
physiologic 
data 

Incom- 
plete 

*In  an  ordinal  system,  patients  have  greater  severity  or  care  needs  than  patients 

with  lower  numbers  and  less  than  patients  with  higher  numbers. 

**  Stages  are  ordinal  within  each  staged  disease  category.  CSI  levels  are  ordinal 

within  a  diagnosis. 

***  Hycs  are  ordinal  within  major  disease  category. 


Table  ES.5 

a3yiF?^Risas[  of  the  vARXAnoNr  (r-squ?\ere)  nxr  cost  of  iNDivrDUAL  cases 

EXPLAINED  BY  THE?EE  CASE  MIX  MEASURES 


Diagnosis- 
Related 
Groups 


Patient 
Management 
Categories 


Disease 
Staging 


Triiraned 


0.30 


0.26 


0.17 


Untrimmed 


0.17 


0.15 


0.10 


*  Cost  data  are  untrimmed  and  exclude  DRGs  468  and  470. 
Source:  Calore,  unpublished  data  1985. 


Table  ES.6 

DEATHS  DURING  T^nyCSSIONF  AND  WITHIN  30  DAYS  AFTER  DISCHARGE 
5  Percent  Sairple  of  1984  Medicare  Discharges 


Total 
Acams 
Oregon  5995 
Utah  1897 
li^oining  725 
Washington  8292 
MDntana  1802 
Iowa  7573 
Kansas  6466 
Minnesota  9503 
Nevada  1691 
Arkansas  7385 
S  Dakota  2214 
Wisconsin  11278 
Arizona  5946 
Oklahoma  7779 
N  Mexico  2246 
Colorado  4692 
Nebraska  4491 
Georgia  13633 
Missouri  15294 
Kentucky  9256 
Texas  33071 
N  Dakota  2260 
California  44630 
Louisianna  9814 
Idaho  1807 
Tennessee  14757 
Indiana  12871 
Vermont  784 
Alablam  10603 
Mississippi  7741 
Ohio  21572 
Delaware  1201 
Florida  34209 
Hawaii  1156 
N  Carolina  12150 
Illinois  27084 
S  Carolina  6282 
Penna.  34868 
Michigan  19938 
New  Hairp.  1363 
Virginia  10678 
Maine      3076 


.ths  Within 

Deaths  During 

30  Days 

T^dmission 

8.4% 

4.9% 

8.8% 

5.6% 

7.9% 

4.8% 

8.0% 

5.3% 

8.2% 

5.7% 

7.4% 

5.0% 

7.5% 

5.1% 

7.6% 

5.2% 

8.6% 

6.3% 

8.3% 

5.9% 

6.0% 

3.7% 

7.4% 

5.2% 

7.7% 

5.4% 

8.5% 

6.3% 

6.9% 

4.7% 

7.1% 

4.9% 

7.7% 

5.6% 

7.4% 

5.4% 

7.6% 

5.5% 

7.5% 

5.5% 

7.5% 

5.5% 

6.3% 

4.4% 

7.6% 

5.7% 

7.4% 

5.5% 

6.8% 

4.9% 

7.0% 

5.2% 

7.6% 

5.8% 

7.9% 

6.1% 

6.6% 

4.8% 

6.8% 

5.1% 

7.6% 

5.9% 

8.7% 

7.0% 

7.3% 

5.6% 

8.0% 

6.3% 

8.2% 

6.6% 

7.5% 

5.9% 

7.2% 

5.8% 

7.3% 

5.9% 

7.3% 

5.9% 

6.9% 

5.5% 

7.5% 

6.1% 

7.3% 

6.0% 

Average  Length 
Difference   of  Stay 


•J  •  Z)  o 

3.2% 

3.0% 

2.7% 

2.6% 

2.4% 

2.4% 

2.4% 

2.4% 

2.4% 

2.2% 

2.2% 

2.2% 

2.2% 

2.2% 

2.2% 

2.1% 

2.0% 

2.0% 

2.0% 

2.0% 

1.9%' 

1.9% 

1.9% 

1.8% 

1.8% 

1.8% 

1.8% 

1.8% 

1.8% 

1.8% 

1.7% 


7% 
6% 
6% 
6% 


1.5% 
1.4% 
1.4% 
1.4% 
1.3% 
1.3% 


6.51 
6.64 
7.52 
6.86 
7.00 
8.12 
8.67 
7.69 
7.81 
7.30 
7.45 
8.35 
7.92 
7.75 
7.32 
7.84 
7.94 
7.60 
8.94 
8.46 
8.14 
7.79 
7.71 
7.85 
6.50 
8.65 
8.89 
8.93 
8.15 
7.76 
9.32 

10.41 
8.42 
8.74 
8.83 
9.72 
8.84 

10.03 
9.31 
8.61 
9.28 
8.44 


[continued  on  next  page] 


Table  ES.6 
(Continued) 

DEATHS  DURIMG  ?J3y[ISSIC8Nr  AND  WITHIN  30  DAYS  A^TER  DISCHARGE 
5  Percent  Sairple  of  1984  Medicare  Discharges 


'iUIAL 

DEATHS  WITHIN 

DEATHS  DURING 

AVERAGE,  U 

Aiayis 

30  DAYS 

AOVIISSICgNI 

Dll^'l'EREL^CE 

OF  STA^ 

W.  Virginia  5769 

6.7% 

5.4% 

1.2% 

8.17 

Rhode  Is.   2193 

7.7% 

6.6% 

1.1% 

10.33 

Connecticut  6392 

8.1% 

7.1% 

1.0% 

9.84 

Maryland    6107 

7.4% 

6.7% 

0.8% 

10.33 

Dist.  Col.   1293 

6.7% 

6.0% 

0.7% 

12.20 

Mass.      14560 

7.2% 

6.7% 

0.5% 

11.58 

Alaska       226 

4.0% 

3.5% 

0.4% 

7.96 

New  Jersey  15837 

7.7% 

7.6% 

0.1% 

12.11 

New  York   37973 

7.8% 

8.6% 

-0.8% 

13.82 

U.S.      530423  7.5%  5.9%  1.6%       9.11 


Source:  Eggers  P.  Unpublished  Data. 


Table  ES.7 


RELATIVE  CHARGES  FOR  BEIXIEFICI7\RIES  WITH  EIGHT  CHT^RACTERISTICS 

Conpared  to  Otiierwise  Similar  Beneficiaries 

Controlling  for  DRG  and  Hospital  in  which  Care  was  Rendered 

5  Percent  Sanple  of  1983  IVfedicare  Discharges 


Beneficiary- 
Characteristic 

1.  Nursing  Home 

Resident. 

2.  Hospitalization 

in  Previous  Year. 


Relative 
Charge 

1.062*** 


1.033*** 


3.  Medicaid 

Enrollee 

4.  Aged  079). 


0.985** 


1.038*** 


5.  Very  aged  (>84) . 


0.987* 


6 .  Younger 

disabled  <65) 

7.  Younger  endstage 

renal  disease. 

8.  Older  disabled. 

064) 


0.993 


1.052** 


1.067*** 


*  =  p  <.05  that  ratio  is  =  1.00;  **  =  p  <.01  that  ratio  is 
***  =  p  <.001  that  ratio  is  =  1.00,  using  t  test. 

Based  on  10  random  sanples  of  60  DRGs  each. 

Source:  Jencl^  &  Kay  1986. 


1 .  00 ; 


Table  ES.8 
Medicare  Operating  Margin  By  Urban/Rural  Location  and  Number  of  Beds 
Actual  First  Year  of  PPS  and  Projected  Experience  under  Full  National  Rates 


Percent 

Of  Hospitals 

Number 

Loss 

Profit 

of 

>20% 

20-lC 

|9- 

10-5% 

5-0% 

0-5% 

5- 

-10% 

10 

-20%  20-30% 

>30% 

Hosps 

Urban 

PPS  yr  1 

4 

4 

5 

6 

11 

11 

28 

19 

13 

545 

<100 

beds 

National 
rates 

6 

5 

5 

6 

8 

7 

14 

10 

40 

675 

Urban 

PPS  yr  1 

1 

1 

1 

4 

8 

13 

35 

25 

12 

1347 

100- 

404 

National 

3 

6 

5 

9 

10 

10 

20 

14 

21 

1649 

beds 

rates 

Urban 

PPS  yr  1 

0 

0 

0 

3 

5 

13 

32 

29 

18 

277 

405- 

684 

National 

0 

8 

5 

8 

11 

11 

21 

17 

20 

332 

beds 

rates 

Urban 

PPS  yr  1 

0 

0 

0 

0 

1 

3 

15 

16 

13 

48 

>684 

beds 

National 
rates 

0 

10 

7 

3 

11 

11 

21 

17 

20 

70 

Rural 

PPS  yr  1 

6 

8 

7 

11 

12 

12 

26 

12 

7 

1795 

<100 

beds 

National 
rates 

4 

7 

5 

6 

8 

9 

18 

14 

30 

2029 

Rural 

PPS  yr  1 

1 

4 

7 

10 

17 

17 

31 

11 

3 

357 

100- 

169 

National 

4 

10 

6 

11 

11 

8 

20 

15 

13 

397 

beds 

Rural 

PPS  yr  1 

1 

3 

4 

10 

16 

22 

31 

10 

3 

175 

>169 

beds 

National 

4 

7 

5 

7 

11 

10 

26 

14 

17 

197 

All 

PPS  yr  1 

3.0 

4. 

4 

4.5 

7.6 

10. 

8 

12. 

6 

29.9 

17.7 

9.5 

4544 

hosps 

National 

3.7 

6. 

9 

5.2 

7.5 

9. 

0 

9. 

4 

18.6 

13.8 

26.0 

5349 

Table  ES.9 
Medicare  Operating  Margins  By  Urban/Rural  Location  and  Teaching  Status 
Actual  First  Year  of  PPS  and  Projected  Experience  under  Ftill  National  Rates 


Pe 

rcent  oJ 

f  Hosp 

itals 

- 

Number 

Loss 

Profit 

of 

>20% 

20-10% 

10- 

-5% 

5-0%  0- 

-5%  5- 

10%  10- 

-20%  20 

-30% 

>30% 

Hosps 

Urban 

PPS  yr  1 

2 

2 

3 

5  _ 

10 

13 

32 

22 

11 

1491 

non- 

teach- 

National 

4 

6 

6 

8 

10 

9 

18 

13 

27 

1823 

ing 

Urban 

PPS  yr  1 

0 

1 

1 

3 

6 

12 

27 

37 

15 

608 

light 
teach- 

National 

2 

7 

6 

9 

9 

12 

20 

13 

21 

744 

ing 

Urban 

PPS  yr  1 

0 

1 

0 

3 

4 

3 

27 

30 

31 

118 

heavy 
teach- 

National 

2 

6 

5 

6 

8 

7 

21 

12 

33 

159 

ing 

Rural 

PPS  yr  1 

5 

7 

7 

11 

13 

13 

27 

12 

6 

2271 

non- 

teach- 

National 

4 

7 

5 

7 

9 

9 

19 

14 

26 

2556 

ing 

Rural 

PPS  yr  1 

2 

4 

2 

2 

17 

15 

37 

13 

8 

52 

light 
teach- 

National 

3 

10 

8 

2 

8 

5 

27 

20 

17 

60 

ing 

Rural 

PPS  yr  1 

0 

0 

0 

0 

0 

25 

0 

25 

50 

4 

heavy 
teach- 

National 

0 

14 

0 

0 

0 

14 

0 

14 

57 

7 

ing 

All 

PPS  yr  1 

3.0 

4.4 

4 

.5 

7.6 

10.8 

12.6 

29.9 

17.7 

9.5 

4544 

hosps 

National 

3.7 

6. 

9 

5.2 

7.5 

9.0 

9.4 

18.6 

13.8  26.0 

5349 

Table  ES.IO 

Variations  in  Hospital  Medicare  Operating  Margins  TV^ross  Regions 
Controlling  for  other  hospital  characteristics 


Variable 


PPS  Year  1- 


National  Rates ^ 


IN, 

cm, 

WI 

MS 

KY, 

TN, 

PL 

NB, 

ND, 

SD 

10, 

KA, 

MD, 

TK 

AR, 

lA, 

OK 

NM,  NV 


New  England 

Mid  Atlantic 

South  Atlantic  ex  FL 

South  Atlantic:  FL 

East  N  Cent:  ZL  &  MI 

East  N  Cent: 

East  S  Cent: 

East  S  Cent: 

West  N  Cent: 

West  N  Cent: 

West  S  Cent: 

West  S  Cent: 

Mountain:  UT 

Mountain:  M 

Mountain:  ID,  MT,  CO,  WY 

Pacific:  CA 

Pacific:  AK,  HI 

Pacific:  OR,  WA 

Wage  Index 
Case  Mix  Index 
Teaching  Intensity 
Urban 

9 

Disproportionate  share 

Less  than  50  beds 

50-99  Beds 

100-149  beds 

150-199  beds 

200-399  beds 

More  than  399  beds 


MN 


** 


0.00, 
+5.43 
+1.02 
-0.10 
-4.89 

3.63 

3.43 
-0.16 

2.38 

1.47 

1.08 

1.09 

2.42 
-1.81 

3.52 

1.77 
-2.81 

3.14^ 

-0.084 
0.382 
0.097' 
6.097, 
0.173 
0.00 
2.498 
1.113 
0.476 
0.501 
0.728 


**** 


■k* 


**:lc* 


0.00 
+7.71 
+9.55' 
-0.59 
-1.41 
6.56' 
14.48 
10.80 
16.36 
9.07 
7.71 
11.11' 
14.53 
-1.03 
10.62 
3.93' 
7.37 
16.35 


**** 
**** 


**** 
**** 


**** 


**** 


-0.147 
-0.011 
0.182' 
9.359' 
0.479 
0.00 
-5.887 
-10.841 
-9.134' 
-12.012 
-13.669 


*** 


**** 


Mjusted  R^ 


0.163 


0.091 


*=  p  < . 05  :  **  =  p<.01;  ***  =  p<.001;  ****  =  p<.0001 

For  notes  see  Table  ES.2.  Coefficients  for  regions  are  relative  increase  or 
decrease  in  operating  margin  ccaipared  to  a  otherwise  similar  hospital  in  the 
New  England  region. 


Source:  Medicare  Cost  Reports. 


CHAPTER  1 
THE  GOALS  AND  METHODS  OF  THE  PPS  CASE  MIX  ADJUSTMENT  SYSTEM 
Case  mix  adjustment  is  the  procedure  by  which  Medicare  adjusts  per  case 
payments  to  an  individual  hospital  to  reflect  the  characteristics  of  cases  treated  in 
that  hospital.  Case  mix  adjustments  may  also  be  made  to  compare  hospital 
mortality  rates  and  other  measures  of  hospital  performance,  but  this  report 
discusses  such  potentially  important  uses  only  briefly. 

There  is  no  "gold  standard"  against  which  to  judge  the  adequacy  of  a  case  mix 
adjustment  system,  and  neither  legislation  nor  regulation  specifies  the  exact 
purposes  of  case  mix  adjustment  under  PPS.  To  decide  whether  case  mix 
adjustment  is  doing  what  it  should,  we  must,  therefore  examine  many  different 
kinds  of  evidence  and  fit  tiiem  together  into  a  complete  picture.  This  chapter 
reviews  case  mix  adjustment  under  PPS,  and  identifies  the  goals  the  system  should 
achieve  and  specific  criteria  against  which  it  should  be  judged. 

Case  mix  measurement  and  the  payment  system  are  inseparable.  DRGs  can  be 
evaluated  and  refined  only  as  part  of  a  systein  which  includes  outlier  payments  and 
hospital-level  payment  adjustments.  The  DRG  classification  system  was  never 
designed  or  expected  to  reflect  payments  for  outliers  or  to  take  account  of  factors 
such  as  wages,  which  influence  local  costs.  The  PPS  outlier  payment  was  designed 
to  deal  with  inherent  limitations  of  DRGs,  and  hospital-level  adjustments  were 
designed  to  complement  the  DRG  and  outlier  "systems".  This  interdependence 
dictates  the  structure  of  this  chapter.  Section  I  describes  the  goals  of  case  mix 
adjustment,  Section  II  examines  the  interdependent  system  which  seeks  to  meet 
these  goals,  and  Section  III  discusses  the  significance  of  differences  between  a 
hospital's  costs  and  what  Medicare  pays  and  describes  ways  in  which  different  parts 
of  the  payment  system  can  interact  to  produce  complex  payment  effects. 
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I.  THE  GOALS  OF  CASE  MIX  ADJUSTMENT 
PPS  created  a  revolution  by  paying  hospitals  for  the  care  that  patients  were 
expected  to  need  on  average  rather  than  the  care  that  the  hospital  actually 
provided.  This  involved  two  separate  reforms—paying  for  the  case  rather  than  for 
individual  services  (paying  for  larger  aggregations  of  services  is  often  called 
bundling)  and  defining  the  payment  in  a  schedule  published  before  care  is  rendered 
(prospective  payment). 

A.  GOALS  OF  PPS.    PPS  has  three  primary  goals: 

1.  Promote  Efficiency.  Give  hospitals  incentives  to  provide  quality  care 
as  efficiently  as  possible  by  eliminating  per-service  retrospective  payments,  which 
encouraged  long  stays  and  profligate  use  of  services. 

2.  Control  Costs.  Make  Medicare's  payments  for  hospital  care 
predictable  and  controllable  through  prospectively  determined  payments  for  a  unit 
of  service  (the  admission)  whose  volume  and  composition  has  historically  been 
relatively  predictable. 

3.  Be  Equitable.  Make  resource  allocation  equitable  to  hospitals  and 
patients  by  basing  allocation  on  actual  patient  need  rather  than  on  possibly 
inappropriate  historical  patterns. 

There  is  no  serious  question  that  PPS  has  succeeded  in  its  first  two  goals. 
However,  this  success  does  not  necessarily  imply  equity;  nor  does  initial  success 
necessarily  imply  long-term  viability.  During  the  phase-in  of  PPS,  hospitals  have 
been  protected  against  sharp  changes  in  Medicare  payments  by  a  hospital-specific 
component  and  a  regional  component  in  their  payment  rates.  In  addition,  a 
teaching  adjustment  and  an  urban/rural  adjustment  were  included  to  further  reduce 
sudden  changes  in  payment.  Hospitals  were  further  protected  by  three  "one-time" 
events  which  resulted  in  payments  which  were  generally  higher  than  costs: 
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the    initial   costs   on    which    payments    were    based    were    overestimated 
because  audited  cost  reports  were  unavailable, 
improved  coding  accuracy  significantly  increased  payments,  and 
hospitals  were  able  to  realize  substantial  operating  efficiencies. 
If  as  overpayments  are  corrected  and  hospital-specific  and  regional  payments  are 
phased  out,  hospitals  will  have  less  protection  against  possible  imprecision  in  the 
system.    At  the  same  time,  hospitals  will  face  private  sector  competition,  problems 
with  uncompensated  care,  and  a  Federal  budget  constrained  by  "Gramm-Rudman- 
Hollings." 

In  the  short  run,  PPS  creates  strong  incentives  for  efficiency  independent  of 
its  equity.  A  hospital  has  powerful  reasons  to  shorten  inpatient  stays,  to  provide 
some  services  for  the  inpatient  stay  prior  to  admission  or  after  discharge,  and  to 
provide  a  more  efficient  mix  of  services  because  it  knows  that  each  dollar  spent  on 
care  reduces  its  profits.  PPS  encourages  efficiency  even  if  it  does  not  pay 
hospitals  exactly  what  they  sometimes  need  to  treat  their  patients,  but  chronic 
underpayment  could  decrease  access  to  care  and  quality  of  care. 

B.  CRITICISMS  OF  CASE  MIX  ADJUSTMENT.  When  Congress  asked  for  a 
report  on  severity  of  illness,  intensity  of  care,  and  outlier  payments,  the  clear 
concern  was  whether  the  PPS  case  mix  adjustment  system  would  pay  fairly  for  the 
costs  of  sophisticated,  complex,  and  critical  care.  Hospitals,  physicians,  and  others 
have  charged  that  the  current  case  mix  measurement  system  fails  in  three 
important  ways  to  measure  patients'  needs  for  care: 

1.  Severity.  Critics  charge  that  the  system  fails  fully  to  measure  severity  of 
illness  and  need  for  costly  specialized  and  tertiary  care,  resulting  in  systematic 
underpayments  to  hospitals  specializing  in  such  care.    Specifically,  they  argue  that 
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the  higher  costs  of  teaching  hospitals  and  the  lower  costs  of  small  hospitals  reflect 
differences  in  average  severity  of  patient  illness  which  should  be  measured  by  the 
case  mix  adjustment  system. 

2.  Special  Populations  and  Diseases.  Critics  charge  that  special  costs  of 
caring  for  the  frail,  the  very  old,  the  disabled,  patients  with  special  diseases,  and 
the  poor  are  not  fully  measured,  giving  hospitals  incentives  to  discriminate  against 
such  patients.  As  specific  evidence,  critics  cite  findings  that  the  costs  of  hospitals 
with  a  disproportionate  share  of  poor  patients  are  higher  than  the  case  mix  system 
predicts.  These  higher  costs  led  Congress  to  legislate  in  1986  adjustments  for 
disproportionate  share  hospitals  similar  to  those  enacted  for  teaching  and 
urban/rural  differences  (Consolidated  Budget  Reconciliation  Act  of  1985  (COBRA), 
P.L  99-272). 

3.  Random  and  Systematic  Risk  from  Severely  111  or  "Outlier"  Patients. 
Critics  charge  that  Medicare  makes  the  same  basic  payment  for  patients  who  vary 
so  much  that  hospitals,  especially  small  hospitals,  are  exposed  to  excessive  risk 
from  random  variation  and  have  incentives  to  avoid  some  patients  or  discharge 
them  prematurely.  As  evidence,  critics  cite  individual  cases  in  which  small 
hospitals  have  incurred  very  high  unreimbursed  costs  for  Medicare  beneficiaries 
which  have  little  prospect  of  being  balanced  out  by  payments  for  treating  other 
Medicare  beneficiaries.  Some  critics  also  maintain  that  outliers  are  distributed 
unevenly  across  hospitals,  creating  systematic  problems  for  those  hospitals. 
Critics  have  argued  that  the  outlier  problem  could  be  relieved  by  a  severity 
measure. 

Other  observers  have  made  complaints  about  PPS  which  reflect  on  their 
expectations. 

^'  Perceived  Equity.  The  system  is  neither  well  understood  nor  perceived  to 
be  equitable  by  all  who  participate.    This  encourages  hospitals  which  receive 


payments  lower  than  their  costs  to  feel,  not  that  they  are  penalized  for 
inefficiency,  but  that  they  are  cheated.  Misunderstandings  have  also  led  some 
hospital  managers  and  physicians  to  perceive  outlier  thresholds  and  even  mean 
lengths  of  stay  as  Medicare  limits  on  how  long  patients  are  allowed  to  stay  in 
hospital. 

5.  Limitations  for  Management.  The  case  mix  adjustment  system  does  not 
provide  the  information  which  hospital  managers  need  to  manage  the  care  of 
Medicare  patients.  In  particular,  critics  point  out  that  the  Medicare  system  forces 
hospital  managers  to  examine  costs  in  their  hospitals,  but  that  DRGs  are  not 
sufficiently  precise  for  small  groups  of  patients  to  allow  a  manager  to  determine 
whether  individual  physicians  and  departments  are  efficient. 

6.  Not  Helpful  for  Capitation.  Finally,  critics  most  concerned  with  moving 
Medicare  to  capitation  payment  charge  that  case  mix  adjustment  has  been 
excessively  focused  on  characteristics  of  individual  discharges  to  the  exclusion  of 
the  patient  characteristics  which  must  be  the  basis  of  adjusting  capitation  rates. 

C.  GOALS  FOR  CASE  MIX  ADJUSTMENT.  The  above  criticisms,  whether 
accurate  or  not,  imply  a  reasonable  set  of  study  areas  for  case  mix  adjustment.  A 
case  mix  adjustment  system  should: 

1.  Be  Equitable  to  Hospitals.  Provide  to  each  hospital  total  payment  which  is 
adequate  to  meet  the  efficient  treatment  needs  of  its  Medicare  beneficiaries.  The 
system  should  progressively  clarify  whether  urban/rural,  teaching,  regional, 
disproportionate  share,  and  other  adjustments  rest  on  case  mix  differences  or  on 
other  differences  among  hospitals.  Equity  to  hospitals  should  include  appropriate 
protection  against  the  cost  of  outlier  cases. 

2.  Be  Equitable  to  Beneficiaries.    Pay  for  classes  of  beneficiaries  and  kinds  of 

diseases  an  amount  sufficiently  related  to  the  costs  of  medically  necessary  care  so 

that  no  hospital  has  an  incentive  to  discriminate  against  beneficiaries  with  certain 
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diseases  or  certain  social  or  economic  characteristics  or  to  provide  inadequate 
care. 

3.  Be  Fully  Understood  by  Hospitals,  Physicians,  and  Beneficiaries.  Those  who 
depend  on  the  case  mix  adjustment  system  should  be  able  to  understand  it,  and 
especially  be  able  to  understand  the  concepts  of  averaging  and  outlier  payment  on 
which  it  depends.  The  limitations  of  the  system  for  hospital  management  and 
capitation  should  be  explicit. 

D.  THE  IMPORTANCE  OF  EQUITY.  If  reimbursement  is  not  equitable, 
consequences  could  be  serious  both  for  beneficiaries  and  for  the  health  care 
system.  Medicare  beneficiaries,  or  certain  groups  of  beneficiaries,  might  find  that 
the  health  care  system  could  not  afford  to  treat  them,  and  beneficiaries  might  be 
denied  care  or  dumped  into  hospitals  of  last  resort.  If  payments  are  inequitable, 
hospitals  which  continued  to  treat  all  patients  could  face  severe  financial 
problems,  not  because  of  their  own  inefficiency,  but  because  PPS  did  not  reimburse 
them  accurately.  In  the  long  run,  the  health  care  system  might  become  less  rather 
than  more  efficient. 

Some  commentators  argue  that  there  is  so  much  flexibility  and  "fat"  in  the 
health  care  system  that  even  fairly  serious  payment  inequities  will  simply  be 
absorbed  in  the  short  run.  The  high  average  profits  reported  for  the  first  year  of 
PPS  (Chapter  ^.V)  support  this  argument,  but  not  in  all  hospitals. 

The  designers  of  PPS  also  understood  from  the  outset  that  the  transition  to 
PPS  would  force  many  high-cost  hospitals  to  alter  their  practices  dramatically. 
Conversely,  no  one  knew  how  hospitals  whose  payments  exceeded  their  costs  would 
use  their  newfound  resources.  The  designers  of  PPS  used  cushioning  mechanisms  to 
minimize    the    shock    of    transition   by    fitting    the   new    system    to   historic   cost 
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patterns.  In  any  change,  there  must  be  winners  and  losers,  but  PPS  uses  averages 
in  a  way  which  makes  it  difficult  to  determine  whether  a  hospital  loses  because  of 
inefficiency  or  because  its  patients  have  needs  which  are  recognized  by  the  clinical 
staffs  but  not  by  PPS.  Few  hospitals  which  lose  are  inclined  to  attribute  their 
losses  to  their  own  inefficiency  and  few  hospitals  which  gain  attribute  their  gains 
to  having  a  patient  population  with  unusually  simple  needs. 

Accordingly,  both  Congress  and  HCFA  require  a  system  which  gives  them 
justified  confidence  that  hospitals  win  and  lose  under  PPS  primarily  because  of 
their  efficiency  or  inefficiency  rather  than  because  of  errors  in  the  case  mix 
system.  Equally,  unless  hospitals,  physicians,  and  beneficiaries  believe  that  PPS  is 
equitable,  providers  will  feel  justified  in  manipulating  the  system  and  beneficiaries 
may  ultimately  be  denied  needed  care.  This  report  specifically  addresses  these 
issues  by  exploring  evidence  as  to  which  hospitals  experienced  substantial  gains  or 
losses  in  the  first  year  of  PPS  (Chapter  ^.V)  and  whether  those  gains  or  losses  can 
be  attributed  to  needs  of  patients  which  PPS  does  not  currently  recognize.  To 
understand  this  exploration,  however,  the  reader  will  need  to  appreciate  some 
details  of  PPS,  whose  interdependent  adjustments  are  quite  complex. 

II.  THE  STRUCTURE  OF  THE  CURRENT  SYSTEM 
This   section    describes   the    intricate    system    which    the   architects   of    PPS 
developed  to  assure  that  per  case  payment  to  hospitals  would  be  equitable.     In 
Chapter  'f.I,  we  will  examine  some  of  the  policy  judgments  which  were  made  in 
constructing  this  system. 
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A.  MULTIPLE  ADJUSTMENTS.  PPS  pays  hospitals  for  care  to  Medicare 
beneficiaries  at  a  per-case  rate,  which  is  set  before  care  occurs  and  reflects 
expected  rather  than  actual  costs.  The  per-case  rate  is  carefully  adjusted  to 
reflect  the  differences  in  cases  treated  among  hospitals.  The  DRG  system  has 
received  so  much  publicity  that  many  readers  assume  that  it  is  the  only  way  in 
which  PPS  adjusts  per  case  payments  to  reflect  differences  in  case  mix  between 
hospitals.  Actually,  PPS  uses  a  set  of  interdependent  adjustments,  of  which  DRGs 
are  not  always  the  most  important. 

PPS  first  determines  a  standard  payment  amount  per  case,  which  is  the 
amount  that  would  be  paid  for  the  average  patient  in  the  average  hospital  in  the 
same  kind  of  region  (rural  or  urban).  Currently,  this  standard  payment  is  a  blend  of 
the  hospital's  historical  cost  per  case,  the  regional  average  cost  per  case,  and  the 
national  average  cost  per  case.  Under  original  legislation,  all  PPS  hospitals  were 
to  be  paid  at  a  national  rate  starting  in  fiscal  year  1987,  but  recent  legislation  has 
temporarily  deferred  this  transition  for  most  hospitals  until  1988  (Table  1.1). 

PPS  makes  three  important  kinds  of  adjustments  to  the  standard  payment  in 
order  to  make  per-case  payment  equitable:  DRG-based  measurement  of  case  mix, 
outlier  adjustments,  and  hospital-level  factors.  But  these  adjustments  are  made 
separately  and  in  a  somewhat  more  complex  sequence:  selection  of  a  rural  or  urban 
basic  payment  amount,  adjustment  of  the  payment  amount  by  the  local  wage  index, 
multiplication  of  the  wage-adjusted  payment  amount  by  the  DRG  weight, 
application  of  any  outlier  adjustment,  multiplication  by  any  indirect  teaching 
adjustment,  and,  finally,  addition  of  direct  teaching  and  capital  "pass-throughs." 

Finally,  some  facilities  are  so  different  from  the  typical  PPS  facility  that 
either  Congress  has  excluded  them  from  PPS  by  law  or  the  Secretary  has  exempted 
them  by  regulation.    Excluded  facilities  include  hospitals  with  an  average  stay  over 
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25  days  and  pediatric  hospitals,  hospitals  and  distinct  units  which  primarily  provide 
psychiatric  or  rehabilitation  services,  and  hospitals  located  in  States  which  are 
exempt  from  PPS.  In  addition,  certain  special  facilities  are  exempted  by 
regulation,  including  certain  alcohol,  and  comprehensive  cancer  hospitals  and  units 
(Langenbrunner  1986).  These  facilities  are  paid  under  Tax  Equity  and  Fiscal 
Responsibility  Act  of  1982  regulations  which  will  not  be  discussed  in  this  report. 

B.  URBAN-RURAL  DIFFERENCES.  PPS  classifies  all  hospitals  as  urban  (located 
in  a  Metropolitan  Statistical  Area  (MSA)  or  equivalent  area  or  rural  (all  others)  and 
calculates  standard  payment  amounts  separately  for  the  two  groups.  However, 
there  are  two  further  special  arrangements.  First,  the  Department  has  defined  in 
regulation  certain  features  of  size  and  range  of  services  under  which  about  160 
hospitals  are  designated  as  rural  referral  centers  and  paid  like  urban  hospitals. 
Second,  about  360  more  hospitals  have  been  defined  as  sole  community  hospitals 
because  no  other  hospitals  are  reasonably  accessible  to  their  patients;  they  will 
receive  indefinitely  a  blended  rate  which  is  75  percent  hospital  specific  and  25 
percent  a  regional  rural  rate.  The  Department  will  send  Congress  a  separate 
report  on  urban/rural  differences. 

C.  ADJUSTMENT  FOR  COST  OF  LABOR.  The  next  step  is  to  adjust  the  standard 
payment  amount  to  reflect  the  cost  of  labor.  Originally  this  local  index  was  based 
directly  on  information  from  the  Bureau  of  Labor  Statistics  but  more  recently  the 
Department  has  developed  a  more  sophisticated  measurement  which  includes 
information  from  direct  surveys  of  hospitals  and  controls  for  local  variation  in  the 
number  of  part-time  employees. 

D.  DRGs  AND  THE  CASE-MIX  INDEX.  Once  the  hospital's  standard  payment 
amount  is  determined  and  adjusted  to  reflect  wage  rates,  the  per-case  payment 
rate  is  further  adjusted  to  reflect  the  patient's  illness  and  treatment  by  the  DRG 
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system.  All  discharges  are  assigned  to  one  of  the  DRGs  (see  Chapter  2JV  for 
details).  To  determine  the  payment  for  a  specific  case,  PPS  gives  a  DRG-specific 
weight  (published  annually  in  the  Federal  Register)  to  each  case,  and  the  weight  is 
multiplied  by  the  standard  payment  amount  for  that  hospital.  The  average  DRG 
weight  of  a  hospital's  Medicare  discharges  is  the  hospital's  case  mix  index  (CMI). 
Thus,  while  payments  are  actually  made  on  a  case-by-case  basis,  the  average 
payment  across  all  cases  is  equal  to  the  hospital's  standard  payment  amount 
multiplied  by  the  CMI  and  further  adjusted  as  described  in  Parts  E  and  F  of  this 
section. 

DRGs  address  severity  in  several  ways—by  the  use  of  specific  disease  codes, 
by  differentiation  among  procedures,  by  the  use  of  age,  and  by  taking  certain 
secondary  diagnoses  as  indicators  of  "comorbidity  and  complications."  But  DRGs 
do  not  use  any  clinical  data  or  laboratory  findings  to  measure  need  for  care  and  the 
adequacy  of  the  way  DRGs  measure  severity  of  illness  remains  unproven.  A 
number  of  systems  have  been  proposed  as  possible  refinements  to  or  replacements 
for  DRGs  (Chapter  3). 

E.  OUTLIER  CASES.  After  adjusting  for  DRG,  PPS  makes  special  additional 
payments  for  "outliers"— patients  with  unusually  costly  or  unusually  lengthy  stays. 
The  Social  Security  Amendments  of  1983  required  that  5-6  percent  of  payments 
under  PPS  be  made  through  a  system  designed  to  account  for  outlier  cases.  Outlier 
payments  work  as  follows:  ■ 

a.  Day  Outliers.  For  each  DRG,  PPS  sets  a  threshold  length  of  stay  for 
outliers;  in  1984  the  threshold  was  defined  as  the  lesser  of  20  days  or  1.94 
geometric   standard  deviations  above   the  geometric    mean  l'=»n'Tth  of  stay.     All 
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necessary  inpatient  days  above  the  stay  threshold  are  reimbursed  at  a  flat  per  diem 
rate  which  is  60  percent  of  the  PPS  payment  per  day  for  patients  assigned  to  that 
DRG  (defined  as  the  PPS  payment  divided  by  the  geometric  mean  length  of  stay). 

b.  Cost  Outliers.  In  addition,  the  outlier  policy  makes  allowance  for  patients 
who  have  very  high  bills  but  do  not  exceed  the  length  of  stay  threshold.  For  these 
"cost  outliers,"  PPS  pays  about  ^3  percent  of  charges  exceeding  a  threshold  (which 
was  $12,000  or  130  percent  of  the  DRG  rate  in  1983-198^);  ^3  percent  results  from 
multiplying  the  60  percent  marginal  cost  figure  by  0.72,  which  is  the  average  ratio 
of  operating  costs  to  charges  in  Medicare  bills.  The  thresholds  were  calculated  so 
as  to  have  15  percent  of  cases  cost  outliers  and  85  percent  day  outliers.  Chapter 
^.VI  examines  the  outlier  problem  in  detail. 

F.  INDIRECT  TEACHING  COSTS.  The  teaching  adjustment  is  applied  to  the  total 
calculated  payment,  including  outlier  payments.  The  original  increase  was  11.6 
percent  for  each  tenth  of  an  intern  or  resident  per  hospital  bed,  but  the  COBRA 
reduced  this  adjustment  and  incorporated  it  into  a  more  complex  payment  formula. 
Table  1-2  compares  payments  under  the  old  and  new  formulas.  This  payment  was 
proposed  as  support  for  indirect  costs  of  medical  education  and  is  not  explicitly 
designated  as  related  to  the  case  mix  of  teaching  hospitals. 

G.  REGIONAL  DIFFERENTIALS.  Regional  adjustments  were  designed  to  cushion 
hospitals  during  the  transition  period  (1983-1987)  to  fully  national  rates;  they  are 
scheduled  to  be  phased  out. 

H.  DIRECT  TEACHING  AND  CAPITAL  PASS-THROUGH.  PPS  places  the  direct 
cost  of  approved  medical  education  activities  and  the  costs  of  capital  outside  of 
the  prospective,  case-based  payment  system.  These  costs,  subject  to  certain 
limitations,  are  "pass-throughs,"  and  Medicare's  share  of  these  costs  is  paid  directly 
to  the  hospitals.   This  report  will  not  examine  these  issues. 
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I.  THE  INTERACTION  OF  ADJUSTMENT  FACTORS.  The  preceding  discussion 
makes  clear  that  the  Medicare  case  mix  measurement  system  is  interdependent 
with  PPS  payment  rules  in  ways  which  make  evaluation  difficult.  PPS  determines 
payments  not  only  by  DRGs  but  also  by  the  outlier  payment  system  and  hospital- 
level  adjustments.  For  example,  if  urban/rural  or  teaching  adjustments  are 
inaccurate,  separating  the  results  from  inaccuracies  in  the  case  mix  and  outlier 
systems  will  be  very  difficult.  Likewise,  evaluations  of  the  DRG  classification 
system  will  substantially  underestimate  the  applied  power  of  the  DRG  system 
within  PPS  if  cases  paid  as  outliers  are  included  in  the  evaluation  but  the  outlier 
payments  are  omitted.  A  comparable  evaluation  error  will  occur  if  the  outlier 
system  pays  less  than  intended  and  the  underpayment  is  mistaken  for  a  defect  in 
DRGs.  Finally,  assessing  the  outlier  system  depends  on  a  consistent  interpretation 
of  whether  it  is  intended  to  complement  and  extend  the  DRG  system  by  paying  full 
costs  or  is  intended  to  create  strong  incentives  for  shortening  stays  by  paying  less 
than  costs. 

There  are  no  studies  in  the  literature  that  take  full  account  of  all  these 
adjustments  in  evaluating  the  adequacy  of  DRGs  or  the  equity  of  PPS  (Cretin  and 
Worthman  1986).  Even  Cretin  et  al.'s  (1986)  study,  which  was  the  first  comparison 
of  actual  payments  with  costs  under  PPS,  had  to  create  "synthetic"  costs  because 
198^  Medicare  cost  reports  were  not  yet  available  for  analysis.  We  believe  that 
the  current  report  is  the  first  national  comparison  of  actual  PPS  payments  with 
actual  PPS  costs. 

III.  EQUITY  AND  PAYMENT  DISCREPANCIES 
A  payment  discrepancy  is  the  difference  between  the  payment  which  Medicare 
makes   and    the    actual   cost   of   care.      Individual    payment    discrepancies   do    not 
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necessarily  produce  unfairness— for  example,  an  overpayment  on  one  case  may  be 
offset  by  an  underpayment  on  the  next  case,  producing  an  equitable  average 
payment  to  the  hospital.  Thus,  the  implications  of  individual  payment 
discrepancies  in  a  payment  system  built  on  averages  can  be  subtle  and  very 
difficult  for  a  hospital  manager  to  assess. 

PPS  legislated  a  profound  change  in  hospital  payment  which  is  not  yet 
universally  understood  by  either  PPS  critics  or  hospital  managers.  Medicare  no 
longer  pays  the  cost  of  a  specific  case,  or  the  cost  of  comparable  cases  treated  in 
an  average  hospital,  or  even  the  cost  of  treatment  determined  to  be  necessary  for 
an  individual  case.  Instead,  PPS  seeks  to  pay  hospitals  fairly  for  the  total  care  of 
their  Medicare  patients. 

Because  PPS  uses  hospital-level  factors  based  on  averages,  a  finite  number  of 
DRGs,  and  an  outlier  system  based  on  average  marginal  costs,  it  cannot  pay  an 
exactly  correct  amount  for  each  case.  The  underlying  assumption  in  PPS  is  that 
the  treatment  needs  of  individual  patients  in  a  DRG  do  vary  from  the  average  of 
needs  for  all  patients  in  that  DRG,  but  that  this  variation  is  randomly  distributed 
from  hospital  to  hospital.  Although  some  cases  will  require  fewer  resources  and 
others  will  require  more,  the  average  for  each  hospital  is  expected  to  approximate 
closely  the  average  across  all  hospitals. 

Because  we  cannot  determine  whether  case  mix  measurement  is  accurate  by 
looking  at  individual  cases,  we  must  distinguish  random  cost  variations  from 
systematic  cost  differences  between  hospitals  (McGuire  1983).  The  next  three 
sections  examine  random  discrepancies,  systematic  discrepancies,  and  patient- 
oriented  discrepancies. 

A.  RANDOM  DISCREPANCIES.  Random  discrepancies  result  from  random 
variation  among  cases  and  are  the  necessary  result  of  paying  average  costs.    With  a 
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sufficiently  large  number  of  admissions,  random  discrepancies  average  out  and  are 
unimportant  for  the  hospital's  budget  (this  protection  is  known  as  the  law  of  large 
numbers);  all  but  the  smallest  hospitals  have  enough  admissions  to  secure  this 
protection,  but  many  individual  departments,  physicians,  and  DRGs  do  not.  If  a 
ORG  has  25  cases  and  a  typical  amount  of  variation,  the  average  cost  will  differ 
from  the  national  average  cost  (and  therefore  from  payment)  by  at  least  10  percent 
more  than  half  the  time  and  by  at  least  20  percent  more  than  a  fifth  of  the  time. 

About  1  percent  of  hospitals  with  35  beds  and  a  75  percent  occupancy  rate 
would  experience  a  10  percent  shortfall  by  chance  alone.*  But,  since  such 
shortfalls  are  due  to  chance,  they  cannot  be  expected  to  persist  from  year  to  year, 
and  only  one  hospital  in  10,000  could  expect  such  an  experience  two  years  running. 
Random  payment  discrepancies  attributable  to  within-DRG  variation  are  very 
small  for  classes  of  hospitals  because  the  law  of  large  numbers  affords  protection 
proportional  to  the  total  number  of  admissions  in  the  entire  class  of  hospitals. 

Reducing  the  amount  of  variation  within  a  DRG  or  increasing  the  precision  of 
outlier  payments  protects  hospitals  against  random  risk  even  if  there  is  no 
systeiTiatic  risk.  Even  if  overall  payment  is  equitable,  random  payment 
discrepancies  can  create  significant  probleins  for  hospital  managers,  not  only 
because  they  can  create  cash  flow  problems  but  also  because  they  make  it  difficult 
for  the  manager  to  determine  whether  the  hospital  is  operating  inefficiently  or 
whether  it  is  experiencing  a  chance  run  of  difficult  patients.  This  is  especially  so 
because  management  can  rarely  be  conducted  on  a  hospital-wide  basis.  Rather, 
individual  departments  and  even  units  within  departments  must  be  managed,  and 


*If  the  average  coefficient  of  variation  for  each  DRG  is  0.82  (HCFA  unpublished 
data),  the  relative  standard  error  is  about  ^  percent  for  payments  to  hospitals  with 
1^00  Medicare  discharges  a  year.  A  hospital  with  35  beds,  30  precent  Medicare 
discharges,  and  a  75  percent  occupancy  rate  would  expect  this  protection.) 
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these  have  fewer  discharges  and  are,  therefore,  more  subject  to  chance 
fluctuations.  Thus,  although  Medicare  payment  may  be  equitable  if  the  average 
payment  to  hospitals  is  equitable,  hospital  managers  now  need  to  assess  the 
appropriateness  of  costs  for  smaller  units  in  order  to  manage  effectively.  This 
dilemma  leads  managers  to  complain  that  DRGs  are  too  imprecise  (Chapter  2.I.C 
explores  this  issue)  and  may  lead  them  to  adopt  stringent  cost-saving  strategies  in 
response  to  a  chance  run  of  expensive  cases.  Other  managers,  who  appear  not  to 
understand  the  concept  of  averages,  describe  every  patient  with  costs  above 
average  as  a  "loser"  without  examining  the  counterbalancing  profits  on  patients 
with  costs  below  average. 

B.  SYSTEMATIC  DISCREPANCIES.  Systematic  discrepancies  are  those  against 
which  the  law  of  large  numbers  affords  no  protection.  The  remainder  of  this  report 
focuses  on  possible  causes  of  systematic  discrepancies,  including  unmeasured 
variation  in  treatment  needs  among  patients  in  different  hospitals,  inaccurate 
hospital-level  adjustfnents,  inaccuracies  in  outlier  payment  policies,  and 
differences  in  efficiency  among  hospitals. 

To  the  extent  that  PPS  is  increasingly  able  to  explain  and  correct  systematic 
discrepancies  between  hospitals,  the  problems  posed  by  random  discrepancies  will 
diminish  because  hospital  managers  will  be  able  to  distinguish  random  and 
systematic  discrepancies  and  to  have  increasing  confidence  that  random  variations 
will  average  out  over  time.  A  reasonable  measure  of  success  will  be  the  ability  of 
PPS  to  produce  DRGs  or  other  classification  methods  which  have  increasingly 
homogeneous  costs  across  hospitals  regardless  of  variation  within  hospitals. 

C.  POTENTIAL  DISCRIMINATION.  The  simple  distinction  between  random  and 
systematic  discrepancies  ignores  the  incentive  for  a  hospital  to  identify 
discrepancies  which  have  been  randomly  distributed  in  the  past,  to  send 
unprofitable  cases  elsewhere  (dumping),  and  to  recruit  profitable  cases  (skimming). 
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Hospitals  can  do  this  if  they  can  predict  that  the  PPS  payment  will  differ  from  the 
cost  of  necessary  treatment  for  any  particular  patient  and  can  find  a  way  to  fend 
off  losers  and  recruit  winners.  Thus,  variation  or  heterogeneity  within  a  DRG 
creates  risk  of  a  discrimination  against  unprofitable  patients  if  and  only  if  a 
hospital  can,  or  believes  it  can,  identify  those  patients  who  will  be  "losers"  either 
in  advance  of  admission  or  soon  after  admission. 

A  troubling  implication  of  this  analysis  is  that  payment  to  the  hospital  need 
not  be  incorrect  in  order  to  create  discrimination.  If,  for  example,  hospital 
administrators  commonly  believe  that  admissions  from  nursing  homes  or  admissions 
of  impoverished  beneficiaries  are  "losers,"  those  beneficiaries  may  encounter 
discrimination  even  if  the  common  belief  is  in  error.  Thus,  determining  whether 
such  incentives  exist  is  important  so  that  the  incentives  can  be  corrected  if  they 
are  found  and  so  that  hospital  managers  can  be  educated  if  there  is  no  rational 
ground  for  discrimination  (see  Chapter  2.1 11. A  and  3. VII.). 

A  second  implication  is  that,  if  the  more  costly  patient  can  be  identified 
before  admission,  even  a  well-designed  and  equitable  payment  system  can  become 
the  basis  of  discrimination  for  a  manager  more  interested  in  profits  than  in 
providing  health  care.  Peer  Review  Organizations  (PROs)  have  a  special  role  in 
identifying  such  problems  if  they  begin  to  emerge  and  a  special  mandate  to  prevent 
the  "dumping"  of  patients  through  premature  or  inappropriate  discharge. 
D.  HOSPITAL  DIFFERENCES  AND  EQUITY.  A  critical  question  in  determining 
equity  to  beneficiaries  and  hospitals  is  whether,  after  case  mix  correction  and 
hospital-level  adjustments,  hospitals  either  do  or  potentially  can  receive,  select,  or 
recruit  patients  whose  treatment  costs  differ  systematically  from  the  average 
patient  in  the  same  DRG. 
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This  question  can  be  put  in  a  different  way.  If  we  examine  all  patients  in  a 
region  who  are  assigned  to  a  DRG,  is  the  variation  in  treatment  needs  within  that 
DRG  randomly  distributed  across  hospitals  or  's  it  systematically  associated  with 
factors  which  either  do  or  could  determine  the  hospital  which  a  patient  enters?  If 
variation  is  correlated  with  hospital  selection,  then  some  hospitals  may  get 
patients  with  greater  treatment  needs,  others  may  get  patients  with  less  treatment 
needs,  and  the  system  is  potentially  unfair  or  biased.  If  the  hospital-level 
adjustments  and  outlier  policy  do  not  correct  the  bias,  then  the  system  is  unfair  in 
practice. 

Teaching  hospitals,  public  hospitals,  and  specialty  hospitals  can  easily 
demonstrate  that  their  patients  have  distinctive  characteristics  (e.g.,  being 
referred,  being  poor,  having  special  kinds  of  illness).  The  issue  is  whether  these 
distinctive  patient  characteristics  are  correlated  with  the  variation  in  treatment 
requirements  within  a  DRG.  If  that  correlation  exists  and  if  hospital-level  and 
outlier  adjustments  do  not  correct  for  it,  then  payments  to  these  hospitals  will  not 
reflect  their  patients'  needs. 

PPS  was  constructed  with  a  built-in  conflict  between  an  effort  to  assure 
through  adjustments  that  groups  of  hospitals  did  not  win  or  lose  and  an  implicit 
assumption  that  some  hospitals  were  less  efficient  than  they  should  be.  The 
conflict  can  only  be  avoided  by  assuming  that  all  classes  of  hospital  are  equally 
efficient,  with  variations  in  efficiency  occurring  exclusively  within  classes  but 
never  between  classes.  Despite  this  paradox,  the  occurence  of  major  systematic 
winners  or  losers  under  the  first  year  of  PPS  would  raise  questions  about  the  equity 
of  the  system. 

E.  SUMMARY.    Section  I  described  the  goals  of  case  mix  adjustment  as  equity  to 
both  hospitals  and  beneficiaries  and  defined  equity  as  paying  the  cost  of  necessary 
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care.  Section  II  outlined  the  way  in  which  PPS  seeks  to  accomplish  those  goals. 
Section  III  underlined  the  difference  between  paying  fairly  for  each  case  and  paying 
fairly  on  average  and  emphasized  that  the  policy  of  paying  fairly  on  average  poses 
difficulties  for  both  hospital  managers  and  PPS  managers  who  seek  to  determine 
whether  the  system  is  indeed  equitable.  The  next  chapter  examines  the  problem  of 
predicting  necessary  care  for  individual  patients  and  examines  the  success  of  the 
DRG  system  in  meeting  this  goal. 
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Table  1.1 
Transition  from  Hospital-Specific  to  National  Rates 


A.  Hospital-specific  and  Federal  Portions 
as  percent  of  total  payment 

Hospital  Fiscal  Year  Begins 


10/1/83- 
9/30/84 

10/1/84- 
9/30/85 

10/1/85 
9/30/86 

10/1/86- 
9/30/87^ 

after 
10/1/87 

Hospital-specific 

75 

50 

50-45^ 

25 

0 

Federal 

25 

50 

50-55^ 

75 

100 

B.  Regional  and  National  Coiiponents  of  Federal  Portion 
as  percent  of  Federal  portion 

Discharges  Occurring 


10/1/83- 
9/30/84 

10/1/84- 
9/30/85 

10/1/85 
9/30/86 

10/1/86- 
9/30/87^ 

after 
10/1/87 

100 

75 

75 

50 

0 

0 

25 

25 

50 

100 

Regional 

National 

The  blend  remains  50  percent  hospital-specific  through  the  first  7  months 
of  this  reporting  period  and  then  becomes  45  percent  hospital-specific  for  the 
last  5  months  of  the   reporting  period. 

Note:  Hospitals  in  Oregon  move  to  national  rates  somewhat  more  rapidly  than 
this  table  shows. 

Source:  Federal  Register  1986;  51:16774. 


Table  1.2 

Indirect  Medical  Education  7\djustments  under  Old  and  New  Law 
Percent  Increase  in  Payment 


Residents 

per 

Discharcres  Occurrina 

100  beds 

Before  5/1/86 

5/1/86-9/30/88 

after  10/1/88 

none 

0 

0 

0 

10 

11,6 

8.1 

8.7 

25 

29.0 

18.9 

20.7 

50 

57.9 

35.7 

39.7 

Source:  Social  Security  Amendments  of  1983  (P.L.  98-21)  and  Consolidated 
Qraiibus  Budget  Reconciliation  Act  of  1985  (P.L.  99-272). 
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PREDICTING  NEEDED  CARE  FOR  INDIVIDUAL  PATIENTS 


Understanding  the  issues  of  severity  of  illness,  intensity  of  care,  and  outlier 
payment  policy  requires  understanding  the  basic  problem  of  predicting  need  for  care 
from  facts  about  the  patient  and  the  patient's  illness.  This  chapter  focuses  on  using 
a  set  of  patient  characteristics  (the  independent  variables)  to  predict  a  patient's 
treatment  needs  (the  dependent  variable).  Section  I  examines  the  assumptions  which 
Medicare  built  into  PPS  by  adopting  DRGs  and  by  the  way  PPS  pays  for  outliers, 
Section  II  explores  the  concept  of  necessary  care  and  looks  at  the  sources  of 
variation  for  which  Medicare  should  and  should  not  pay,  Section  III  reviews  the 
issues  in  choosing  variables  to  predict  necessary  care,  and  Section  IV  assesses  DRGs 
as  a  case  mix  measurement  system.  Chapter  3  evaluates  alternatives  to  DRGs  using 
the  framework  applied  to  DRGs  in  Section  IV,  and  Chapter  4  examines  the  problem 
of  predicting  the  average  care  needs  for  all  patients  in  a  hospital. 

I.  THE  PPS  MODEL  OF  DIFFERENCES  IN  TREATMENT  NEEDS 
In  Chapter  1  we  reviewed  the  structure  of  PPS.    We  now  examine  both  the  DRG 
system  and  the  outlier  payment  system  in  order  to  understand  the  assumptions  about 
necessary  care  for  individuals  which  are  built  into  PPS. 

A.  THE  MODEL  IMPLICIT  IN  DRGs.  The  DRG  system  operates  from  the  Uniform 
Hospital  Discharge  Data  Set  (UHDDS),  which  hospitals  collect  for  every  Medicare 
discharge  and  for  most  other  discharges.  UHDDS  contains  only  the  principal 
diagnosis,    four    additional    diagnoses,    three    procedures,    the    patient's    age    and 
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discharge  status  (alive,  dead,  against  medical  advice,  transferred  to  another 
hospital,  transferred  to  another  institution,  sent  home),  admission  and  discharge 
dates,  and  expected  source  of  payment.  Thus,  Medicare's  decision  to  adopt  DRGs 
meant  that  PPS  could  use  only  UHDDS  variables  as  indicators  of  need.  Adopting 
DRGs  also  meant  incorporating  into  PPS  certain  conventions  which  had  been 
accepted  by  the  team  which  created  the  DRGs. 

Death  would  be  used  to  classify  patients  only  if  it  resulted  in  lower 
resource  consumption;  Medicare  would  not  reward  hospitals  if  patients 
died. 

There  would  be  only  300-500  DRGs;  the  number  of  groups  would  be  small 
enough  to  assure  significant  numbers  of  patients  in  each  group  in  most 
hospitals.  This  prevented  creating  small  DRGs  for  unusual  cases  and 
meant  that  a  special  policy  would  be  necessary  to  pay  for  such  cases  if 
Medicare  decided  that  they  were  not  evenly  distributed  or  that  they 
presented  an  unacceptable  risk  to  hospitals.  This  decision  reflected  the 
origin  of  DRGs  as  a  hospital  management  studies  tool  which  needed  a 
meaningful  number  of  cases  in  each  category  within  a  hospital  (Cretin  and 
Worthman  1986). 

A  single  set  of  comorbidity  and  complication  indicators  would  be  used  for 
all  DRGs;  testing  separate  lists  for  many  different  conditions  would  have 
required  computer  resources  and  personnel  which  were  not  available.  This 
meant  that  the  system  would  not  seek  to  distinguish  between 
complications  and  comorbidity  and  might  be  criticized  for  rewarding  care 
resulting  in  complications. 

Surgery  would  be  used  to  make  the  first  division  of  groups  within  a  major 
diagnostic    category;    since    the   groups    were    intended    to   be    clinically 
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homogeneous,  patients  were  separated  according  to  the  most  important 

division    in    medicine.      This   meant   that   the   system    would   accept    the 

surgeon's  decision  to  operate  as  a  reasonable  basis  for  deciding  payment; 

clinical  decisions  with  more  modest  cost  implications  were  generally  not 

recognized  as  a  basis  for  payment. 

The  decision  to  limit  the  number  of  groups  and  the  decision  that  DRGs  would  not  be 

designed  to  pay  outliers  led  directly  to  creation  of  an  outlier  policy  to  safeguard 

payment  equity  for  hospitals  and  individuals. 

B.  OUTLIER  POLICY.  The  most  important  ways  in  which  outlier  policy 
incorporated  an  implicit  model  of  interpatient  differences  was  the  way  in  which 
marginal  costs  were  defined  and  the  decision  that  PPS  would  not  have  an  outlier 
policy  for  low-cost  cases. 

Marginal  costs  were  calculated  on  an  assumption  that  the  marginal  cost 
for  outlier  cases  was  significantly  lower  than  the  average  per  diem  cost 
for  nonoutliers.  Information  to  test  this  hypothesis  was  not  available 
(Chapter  4  provides  recent  evidence  on  the  issue). 

The  legislative  decision  not  to  have  short-stay  outlier  policy  meant  that 
unusually  short-stay  patients  were  not  treated  as  qualitatively  different 
from  usual  patients.  Although  this  supported  a  program  emphasis  on 
incentives  for  efficiency,  it  missed  an  opportunity  to  pay  differently  for 
patients  admitted  for  observation  of  possible  disease  rather  than  for 
treatment. 

n.  DEFINING  NECESSARY  CARE 
The  acid  test  of  an  equitable  payment   system    is   that   it  pays   the   cost  of 
efficiently,  providing  necessary  care.     But  defining  what  care  is  necessary  is  not 
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simple  and  becomes  especially  formidable  when  the  payment  is  based  on  the  average 
cost  of  comparable  cases  rather  than  on  retrospective  review  of  whether  the  care 
actually  rendered  was  appropriate. 

A.  THE  SEVERITY  OF  ILLNESS  MODEL.  To  explain  the  difficulty  of  projecting 
necessary  care  from  information  about  the  patient,  we  will  start  by  considering  the 
"severity  of  illness"  modeL  This  model  rests  on  the  assumption  that  we  can  classify 
patients  into  categories  which  are  homogeneous  with  regard  to  all  characteristics, 
except  severity  of  illness,  and  that  we  can  then  determine  the  cost  of  necessary 
care  if  we  know  the  patient's  "severity  of  illness."  When  treatment  costs  vary 
among  patients  with  the  same  severity  of  illness  in  the  same  category,  the  severity 
model  attributes  those  differences  primarily  to  differences  in  efficiency  of  care  and 
does  not  make  allowance  for  other  sources  of  variation.  Although  this  model  is 
more  simplified  than  that  used  by  any  investigator  in  the  field,  it  highlights 
important  issues. 

1.  Defining  "Severity  of  Illness".  To  understand  this  model  we  need  first  to 
define  severity  of  illness,  whose  meaning  varies  in  four  ways. 

a.  Professional  Perspectives.  To  the  physician  caring  for  an  acutely  ill  patient, 
clinical  severity  of  illness  is  usually  a  measure  of  the  likelihood  of  irreversible  loss 
of  function,  failure  of  an  organ  system,  or  death.  To  nurses  caring  for  the  patient, 
and  to  family  and  social  service  agencies,  severity  may  also  denote  the  patient's 
total  chronic  impairment  and  dependency  on  others.  To  a  hospital  administrator, 
severity  may  mean  how  many  resources  will  be  needed  for  a  patient's  care. 

b.  Response  to  Treatment.  Some  investigators  include  the  patient's  response  to 
treatment  in  measuring  severity.  This  definition  implies  that  the  "true"  severity  of 
an  illness  is  reflected  in  whether  it  is  easily  treatable.  If  two  patients  with 
pneumonia  have  identical  clinical  findings  on  admission  to  a  hospital,  the  one  who 
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fails  to  respond  promptly  to  treatment  is,  from  this  perspective,  more  severely  ill 
than  the  one  who  responds  promptly.  Unfortunately,  if  we  measure  severity  after 
the  beginning  of  treatment,  we  risk  confounding  patient  condition  with  the 
effectiveness  of  care  rendered.  Most  investigators,  therefore,  agree  that  a  system 
which  could  predict  response  would  be  preferable  to  one  which  required  actual 
observation  of  the  patient's  clinical  course,  but  such  precise  predictions  are  beyond 
the  state  of  clinical  art  and  far  beyond  the  state  of  computer  based  systems  based 
on  abstracted  data.  The  next  chapter  describes  the  APACHE  system,  which  was 
designed  to  forecast  mortality. 

c.  Chronic  v.  Acute  Severity.  Some  investigators  focus  on  disease  progression 
as  a  chronic  condition  (e.g.,  Disease  Staging,  which  is  based  on  the  model  of  chronic 
diseases  such  as  cancer).  Others  focus  on  the  patient's  acute  condition  at  the 
moment  of  hospital  admission  (e.g.,  APACHE  and  MEDISGRPS).  The  two  models 
clearly  imply  different  strategies  for  description  and  measurement. 

d.  Treatable  v.  Untreatable.  Theoretically  we  might  wish  to  define  severity  as 
the  portion  of  the  patient's  illness  which  is  amenable  to  treatment  thus  excluding 
from  payment  those  components  of  severity  which  are  not  treatable.  Such  an 
approach  implies  kinds  of  triage  which  are  beyond  the  current  state  of  the  art  and 
may  not  be  desirable  in  some  cases,  but  the  idea  is  relevant  to  the  severity  model. 

e.  A  Definition.  In  this  report,  severity  of  illness  is  the  patient's  clinical 
condition  at  the  time  of  hospital  admission  as  indicated  by  the  clinical  likelihood  of 
adverse  outcomes  such  as  death,  permanent  impairment,  or  disability. 

2.  Limitations  of  the  Severity  Model.    Even  if  we  could  agree  on  a  definition  of 
severity,  the  severity  model  is  limited  by  variation  on  four  important  dimensions: 
—accepted  medical  practice, 

—the  reasons  why  patients  who  have  the  same  condition  are  admitted, 
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— variation  in  what  patients  want  and  need  from  their  health  care,  and 

—the  effects  of  death  on  costs. 
These  limitations  have  significant  payment  implications  and  lead  us  to  the  part    of 
the  congressional  mandate  which  concerns  intensity  of  care. 

B.  THE  INTENSITY  OF  CARE  OR  VARIATION  IN  PRACTICE  PATTERN  MODEL.  A 
major  limitation  of  the  severity  model  is  the  implicit  assumption  that  there  is  one 
way  to  treat  a  patient  with  a  particular  condition  at  a  particular  level  of  severity. 
This  implicit  assumption  suggests  that  there  is  consensus  on  a  common  set  of 
procedures  or  a  common  set  of  outcomes  which  can  be  expected.  In  fact, 
procedures,  expectations,  and  outcomes  vary  widely,  resulting  in  substantial 
variation  in  intensity  of  care  for  patients  with  similar  severity  of  illness. 

1.  Practice  Pattern  Variation.  The  most  dramatic  evidence  of  variation  in 
medical  practice  describes  the  actual  frequency  of  hospitalization.  Wennberg  and 
Gittlesohn  (1982)  and  Wennberg  et  al.,  (1984)  have  shown  that  rates  of  hospital 
admission  for  various  procedures  vary  widely  over  small  areas;  Chassin  et  al.  (1986) 
have  shown  comparable  variation  over  large  areas.  These  variations  have  not  been 
shown  to  reflect  either  inappropriate  medical  practice  or  inefficient  use  of 
resources.  Hospitalization  rates  are  sharply  lower  (Luft  1978,  1981)  for  patients  in 
prepaid  health  plans  compared  to  those  in  fee-for-service  settings,  and  this  is  not 
explained  by  selective  enrollment  (Manning  et  al.  1984).  Since  PPS  pays  only  for 
patients  who  have  been  admitted,  differences  in  admission  practices  might  not  be 
critical  to  cost  per  admission,  but  if  the  patients  needing  less  care  are  not  admitted, 
average  costs  of  other  admissions  may  be  higher.  GornicI<  (1982)  has  provided  some 
evidence  that  this  reciprocal  relationship  occurs. 

Mitchell  et  al.  (1985b)  provide  a  useful  example  of  practice  pattern  variation  by 
contrasting  treatment  for  similar  conditions  in  four  states.    Cases  which  fall  within 
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the  same  DRGs  vary  significantly  across  States  (an  issue  of  considerable  importance 
when  we  consider  the  relation  of  case  mix  evidence  to  national  rates  in  Chapter  4), 
but  the  differences  in  practice  patterns  appear  to  be  far  greater  than  variation  in 
case  severity,  lezzoni  and  Moskowitz  (1984)  used  different  methods  to  reach 
parallel  conclusions  in  comparing  New  Jersey  and  North  Carolina  data;  Cromwell 
et  al.  (1985)  show  similar  variation  across  hospital  types.  Variation  in  practice 
patterns  across  regions  is  well-documented,  and  the  shorter  stays  observed  in 
western  States  compared  to  eastern  States  is  well  known  (Chassin  1983,  Beebe  et  al. 
1985).  At  least  one  careful  study  (List  et  al.  1983),  has  shown  that  this  variation 
results  from  practice  pattern  variation  rather  than  patient  characteristics. 

The  problem  of  practice  pattern  variation  was  familiar  to  the  architects  of 
PPS,  and  the  decision  to  phase  in  national  rates  reflected  a  considered  conclusion 
that,  while  uniform  practice  patterns  were  not  necessary,  it  was  unreasonable  to 
continue  to  pay  different  amounts  in  for  essentially  comparable  cases  in  different 
regions.  But  such  variations  in  practice  patterns  conflict  with  the  severity  of  illness 
model  because  heterogeneity  of  practice  does  not  support  the  assumption  that  there 
is  a  single  medically-accepted  way  of  treating  a  patient  with  a  given  condition  and  a 
given  severity  level.  Practice  pattern  heterogeneity  also  makes  determining 
whether  patients  are  truly  comparable  very  difficult. 

Whether  Medicare  should  pay  for  different  practice  patterns  is  a  difficult 
question.  Few  would  argue  that  Medicare  should  pay  more  than  the  lowest  cost  if 
identical  patients  are  treated  at  different  costs  with  identical  outcomes;  Medicare 
should  not  pay  for  therapeutic  activity  which  has  no  benefit  to  the  patient.  But 
there  are  few  areas  of  medicine  where  patient  care  can  be  based  on  unequivocal 
outcome  studies.  The  picture  is  further  obscured  because  variations  in  reason  for 
admission  and  goals  of  treatment  are  often  mistaken  for  variation  in  practice  styles. 
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2.  Variation  in  Reason  for  Admission.  Patients  with  identical  conditions  and 
identical  severity  of  illness  may  be  admitted  for  sharply  different  reasons.  The 
admitting  physician's  intent  is  often  independent  of  clinical  condition  and  reflects 
such  issues  as: 

a.  Diagnostic  v.  Therapeutic  Intent.  Some  patients  are  admitted  to  a 
hospital  after  the  diagnosis  is  established  while  others  are  admitted  for 
diagnostic  evaluation.  The  procedures  performed  can  indicate  this  difference  in 
some  cases,  which  may  affect  DRG  assignment.  For  example,  if  a  cystoscopy  is 
not  performed  during  an  admission  when  a  patient  has  a  prostatectomy,  it  is 
likely  that  one  was  performed  prior  to  admission  and  that  the  diagnosis  was 
established  before  admission.  In  the  case  of  coronary  artery  bypass  grafting, 
patients  are  assigned  to  different  DRGs  according  to  whether  a  cardiac 
catheterization  was  performed  during  the  same  admission.  A  catheterization 
suggests  that  the  diagnosis  was  not  fully  established  before  the  admission,  but 
absence  of  a  catheterization  does  not  indicate  whether  the  diagnosis  was 
established  prior  to  admission  or  by  other  procedures  during  the  admission.  In 
general,  emergency  admissions  are  likely  to  require  inpatient  diagnostic 
evaluation  and  to  have  higher  costs  (Munoz  et  al.  1986). 

b.  Complete  v.  Staged  Treatment.  The  exampl^d*  of  staged  treatment 
which  has  been  most  intensively  discussed  for  DRGs  is  the  contrast  between 
two  admissions  and  one  admission  for  bilateral  major  joint  surgery;  this  problem 
has  been  resolved  by  creating  a  new  DRG  for  multiple  procedures.  But  similar 
problems  exist  for  many  other  conditions,  such  as  whether  to  operate  on  a 
patient  whose  cholecystitis  has  been  treated  or  whether  to  return  that  patient 
for  surgery  at  a  later  date  (Gertman  and  Lowenstein  1984). 

c.  Palliative  v.  Curative  treatment.  The  choice  between  palliative  and 
curative  treatment  is  one  of  the  most  sensitive  in  clinical  medicine,  involving 
subjective  and  value-laden  judgments.     Some  patients   wish   heroic    measure; 
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others  do  not.  Bedell  and  Delbanco  (1984)  found,  for  example,  that  8  of  24 
patients  who  had  been  successfully  resuscitated  after  a  cardiopulmonary  arrest 
said  that  they  had  not  wished  resuscitation  and  would  not  wish  it  in  the  future. 
Patients  choosing  between  alternative  cancer  therapies  make  similar  decisions. 
McNeil  et  al.  (1982)  have  shown  that  patient  decisions  are  highly  sensitive  to 
the  nuances  of  the  way  the  physician  presents  the  decision.  Issues  of  this  kind 
appear  to  be  beyond  the  reach  of  any  severity  measurement  system. 

d.  Operative  Procedures.  Objective  data  tend  to  capture  well  whether  a 
patient  should  be  treated  surgically  or  medically,  which  is  one  reason  why 
surgeons  rather  than  computers  make  such  decisions.  Even  if  such  data  capture 
were  possible  at  a  point  in  time,  the  criteria  would  probably  be  dated  by  the 
time  they  could  be  put  in  place  for  Medicare  payment.  Here  we  note  only  that 
predicting  whether  surgery  is  necessary  from  admission  severity  appears 
impractical.  In  Section  III.B,  we  will  discuss  the  problems  raised  by  determining 
payments  from  procedures  performed. 
3.  The  Effects  of  Death  on  Costs. 

a.  Patients  Who  Die  In  Hospitals.  A  number  of  studies  have  now  shown 
that  the  most  seriously  ill  patients  do  not  necessarily  consume  the  largest 
amount  of  resources — some  die  relatively  quickly  an3  receive  little  treatment 
(Conklin  et  al.,  1984a,  Conklin  1985;  Horn  et  al.,  1984b;  Wagner  and  Draper 
1984).  The  DRG  system  recognizes  this  effect  in  myocardial  infarction,  where 
a  special  DRG  for  patients  who  die  receives  somewhat  lower  payment  than 
DRGs  for  patients  who  live.  If  death  could  be  predicted  accurately,  this  effect 
would  not  confound  the  effort  to  predict  costs  from  severity  of  illness,  but 
these  severely  ill  patients  tend  to  be  extremely  expensive  if  they  live,  causing 
serious  predictive  problems. 
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b.  Where  Patients  Die.  Many  patients  do  not  die  in  hospitals  because 
they,  their  physicians,  or  the  health  care  system  elect  to  discharge  them  to 
other  settings  before  they  die.  Hospices  are  only  one  example  of  health  care 
system  arrangements  which  promote  such  discharges.  Patterns  vary  widely 
from  State  to  State  (Chapter  4. Ill)  and  presumably  reflect  both  patient 
preferences,  and  physician  judgment  as  to  the  circumstances  which  can  best 
combine  good  medical  care  with  a  supportive  environment. 

4.  Severity  and  Cost  of  Care.  These  reservations  regarding  the  severity  model 
may  appear  somewhat  academic  to  readers  who  are  convinced  that,  nevertheless, 
severity  differences  obviously  account  for  most  variation  in  cost  of  care.  Although 
available  data  do  suggest  that  severity  of  illness  is  an  important  source  of  variation 
in  cost  per  case  within  hospitals  (Horn  and  Sharkey  1983;  Horn  et  al.,  1984b;  Horn  et 
al.,  1985a, b;  Brewster  et  al.,  1985b,  Wagner  and  Draper  1984)  and  between  hospitals 
of  a  similar  type  (Horn  et  al.,  1984,  1985a,b),  there  is  little  evidence  that  severity 
differences  explain  the  majority  of  differences  in  costs  between  hospital  types.  In 
Chapter  3.  VII,  we  will  see  that  such  "obvious"  predictors  of  severity  as  poverty, 
disability,  extreme  age,  and  previous  hospitalization  do  not,  in  fact,  predict  higher 
costs  once  DRG  and  hospital  are  taken  into  account. 

C.  SOURCES  OF  VARIATION  IN  COST  OF  CARE  FOR  WHICH  MEDICARE  DOES 
AND  DOES  NOT  PAY.  In  principle.  Medicare  pays  only  for  causes  of  cost  variation 
beyond  the  hospital's  control.  To  clarify  the  concept  of  needed  care,  we  will 
consider  seven  sources  of  cost  variation.  Of  these.  Medicare  agrees  to  pay  for 
three,  seeks  not  to  pay  for  one,  and  has  no  explicit  policy  regarding  three: 
1.  Accepted  Sources  of  Variation.  Medicare  accepts  responsibility  for: 
a.  Factor  Prices.  These  are  the  prices  which  the  hospital  must  pay  for  labor, 
capital,  and  house  staff  services.     As  noted  above,  however.   Medicare  implicitly 
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assumes   that   prices   of  supplies   and   equipment,   and   total  costs  for  energy  and 
security  are  uniform  across  hospitals. 

b.  Referral  Patterns,  Patients  are  referred  to  hospitals  and  transferred 
between  hospitals  so  that  some  hospitals  have  case  mixes  which  are  different  from 
those  of  other  hospitals.  Some  of  this  variation  is  well-described  by  DRGs  (e.g., 
specialization  by  some  hospitals  in  certain  kinds  of  surgery),  but  DRGs  may  not 
capture  costs  when  referrals  result  from  diagnostic  difficulty  or  severity  of  illness, 
which  may  be  the  case  for  teaching  and  referral  hospitals. 

c.  Demographic  Variation  in  Patterns  of  Disease.  Medicare  would  accept 
variations  in  costs  which  result  from  differences  in  kinds  and  severity  of  illness 
across  hospitals  and  regions. 

2.  Rejected  Sources  of  Variation.    Medicare  seeks  not  to  pay  for: 

a.  Inefficiency  of  Care.  One  of  the  major  purposes  of  PPS  is  to  give  hospitals 
incentives  to  become  more  efficient.  Medicare  hopes  that  PPS  will  specifically  not 
pay  for  differences  in  efficiency.  Inefficiency  includes  unnecessary  admissions, 
unnecessary  inpatient  days,  and  unnecessary  services  during  an  admission  as  well  as 
other  kinds  of  inefficiency. 

3.  Ambiguous  Issues.    Medicare  policy  is  ambiguous  on  paying  for: 

a.  Variation  in  Available  Support  Systems.  Medicare  seeks  not  to  pay  for 
hospital  care  which  is  required  because  of  the  absence  of  community  resources  such 
as  nursing  homes;  to  pay  for  such  care  would  encourage  communities  inefficiently  to 
substitute  inpatient  services  covered  by  Medicare  for  less  expensive  services  which 
are  often  not  covered.  On  the  other  hand.  Medicare  does  pay  for  "administratively 
necessary"  days  of  care  and  this  care  was  built  into  DRG  weights  and  payments. 

b.  Admission  Threshold  Variation.  Medicare  clearly  refuses  to  pay  for 
unnecessary    admissions.      But    we   have    noted    that    the    threshold    for  necessary 
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admissions  appears  to  vary  among  physicians  and  across  regions  and  that  this 
variation  may  affect  both  total  costs  and  cost  per  admission.  PROs  review 
admission  thresholds,  but  there  is  no  national  standard. 

c.  Practice  Variations.  There  are  differences,  well  within  the  range  of 
acceptable  medical  practice,  in  what  different  physicians  do  for  patients  with 
identical  conditions.  We  do  not  know  the  extent  to  which  variation  in  treatment 
activity  reflects  differences  in  goals,  such  as  cure  versus  palliation  or  diagnosis 
versus  treatment.  As  noted,  Medicare  would  seek  not  to  pay  for  practice  variations 
which  have  no  effect  on  outcome. 

D.  USE  OF  ACTUAL  PAYMENTS.  In  practice  there  is  no  gold  standard  for 
measuring  need  for  care,  and  investigators  have  widely  accepted  the  use  of  actual 
cost  of  care  as  a  kind  of  silver  standard.  Only  a  very  small  number  of  studies  have 
sought  to  go  beyond  this  standard  (Young  1985;  Strumwasser  and  Pararipe,  1985; 
Systemetrics  1983;  Kay  et  al.,  1986;  Gertman  and  Bucher  1971;  Gertman  and 
Restuccia  1981;  Restucccia  et  al,  1984)  and  none  has  examined  whether 
classification  systems  predict  necessary  care  as  well  or  better  than  they  predict 
care  rendered.  Indeed,  many  investigators  have  been  unable  to  obtain  cost 
information  and  have  had  to  settle  for  charges  or  actual  length  of  stay.  Although 
Cotterill  et  al.  (1985)  have  shown  that  charges  can  be  accurately  substituted  for 
costs  in  some  cases,  charges  cannot  be  used  to  assess  whether  payments  reflect 
costs.  For  the  remainder  of  this  report,  we  will,  therefore,  be  dealing  almost 
exclusively  with  costs  of  care  actually  rendered  rather  than  with  necessary  care,  but 
the  reader  should  keep  in  mind  that  this  is  a  compromise.  The  importance  of  this 
distinction  will  become  clearer  in  Chapter  4.1  when  we  discuss  the  kinds  of  cost 
variations  for  which  PPS  does  not  pay  because  they  are  thought  to  be  unnecessary. 
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III.  PREDICTORS  OF  THE  COST  OF  NECESSARY  CARE:  THE  INDEPENDENT  VARIABLES 
In  this  section  we  examine  four  issues  in  choosing  variables  to  predict  the  cost 
of  an  admission  or  to  determine  the  amount  of  a  payment: 

The  implications  of  variable  choice  for  medical  practice, 
The  choice  of  predictive  variables  versus  variables  which  describe  what 
was  done, 

The  administrative  feasibility  of  using  different  variables,  and 
The  basic  problems  in  design  of  a  system  which  fully  specifies  necessary 
care. 
At  the  end  of  Chapter  3,  after  examining  data  on  alternative  case  mix  measures,  we 
will  consider  what  variables  are  most  credible. 

A.  THE  CONCEPT  OF  NEUTRALITY;  HOW  THE  CHOICE  OF  PREDICTOR 
VARIABLES  CAN  SHAPE  CLINICAL  PRACTICE.  The  choice  of  predictor  variables 
in  a  case  mix  system  is  extremely  important  in  determining  not  only  the  power  and 
administrability  of  a  system  but  also  its  impact  on  clinical  practice.  The  choice  of 
independent  variables  expresses  the  philosophy  of  the  system  as  to  what  sources  of 
variation  in  cost  of  care  are  legitimate,  and  the  choice  directs  a  system  clinically 
and  socially.  A  key  concept  in  the  analysis  which  follows  is  that  including  a  variable 
in  a  case  mix  measurement  system  tends  to  make  the  system  neutral  with  regard  to 
that  variable. 

1.  Fiscal  Neutrality  to  Clinical  Decision-Making.  A  system  which  neither 
rewards  nor  penalizes  a  hospital  for  performing  a  procedure  is  fiscally  neutral  to 
that  procedure.  For  example,  DRGs  use  major  procedures  to  classify  patients.  If 
reimbursements  are  properly  calculated  for  the  DRGs  with  and  without  surgery,  the 
difference  in  reimbursement  is  the  cost  of  surgery,  all  other  things  being  equal. 
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Thus,  the  hospital  is  neither  rewarded  nor  penalized  for  performing  surgery  and  the 
provider  is  free  to  make  this  decision  on  clinical  grounds.  Coffey  and  Goldfarb 
(1984)  report  that,  while  teaching  and  nonteaching  hospitals  in  Maryland  have  very 
similar  case  mix  as  described  by  Disease  Staging,  teaching  hospitals  have  more 
complex  case  mixes  as  described  by  DRGs.  This  difference  occurs  because  teaching 
hospitals  do  more  procedures  for  patients  at  a  given  stage  of  illness,  and  DRGs  use 
procedures  to  classify  a  patient  while  Staging  does  not.  One  might  argue  that  this 
increased  frequency  of  procedures  represents  overutilization  and  should  be 
discouraged  or  one  might  argue  that  it  represents  appropriate  concentration  of 
specialized  procedures  in  specialty  centers  and  should  be  treated  with  neutrality.  If 
one  adopted  the  former  view,  one  would  (if  Staging  and  DRGs  are  otherwise  equal) 
adopt  Staging  for  case  mix  classification  because  Staging,  by  not  using  procedures  to 
classify  patients,  creates  an  incentive  for  hospitals  not  to  incur  the  costs  of  surgery. 
If  we  adopted  the  latter  view,  we  would  use  DRGs  for  case  mix  classification  since 
DRGs  are  neutral  to  performance  of  procedures  and,  therefore,  do  not  penalize  a 
hospital  for  becoming  a  surgical  referral  center. 

The  same  argument  can  be  applied,  for  example,  to  whether  hospitals  treat 
psychiatric  patients  in  more  expensive,  specialized  units.  If  such  treatment  is 
useful,  PPS  should  consider  such  care  a  procedure  and  use  it  to  classify  patients  for 
payment.  If  special  units  are  wasteful,  PPS  should  not  use  care  in  special  units  for 
classification.  This  would  create  strong  economic  incentives  to  hospitals  not  to 
operate  specialized  units.  These  two  examples  suggest  that  the  problem  of 
neutrality  is  a  far-reaching  issue  since  it  raises  the  problem  of  the  extent  to  which 
Medicare  will  pay  for  treatments  which  are  more  expensive  but  more  effective,  for 
more  expensive  treatments  whose  efficacy  is  unproven,  and  for  patient  and 
physician  preferences.  To  the  extent  that  higher  intensity  of  care  is  not  recognized 
by  the  payment  system,  intensity  of  care  is  likely  to  diminish. 
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2.  Neutrality  to  Choice  of  Patients.  Chapter  1  identified  equity  to 
beneficiaries  as  a  critical  element  in  the  equity  of  PPS.  To  assure  such  equity,  FPS 
should  give  providers  no  incentive  to  prefer,  or  to  avoid,  certain  classes  of 
beneficiaries.  Such  incentives  might  exist  if,  for  example,  admissions  from  nursing 
homes  were  more  expensive  than  admissions  from  other  sources.  The  incentive 
could  be  neutralized  by  making  source  of  admission  a  variable  in  determining 
payment,  so  that  when  such  increased  costs  were  predictably  present  they  would  be 
recognized  by  appropriate  payments  (see  Chapter  3. VIII. D). 

3.  Neutrality  to  Diseases.  Neutrality  to  diseases  across  classes  of  providers 
may  be  a  significant  issue.  Cretin  et  al.  (1986)  provide  evidence  that  DRGs  which 
are  profitable  in  some  classes  of  hospitals  may  be  unprofitable  in  others.  If 
differences  in  case  mix  or  in  appropriate  treatments  explain  these  differences,  the 
resulting  incentives  may  be  undesirable  . 

4.  Neutrality  Regarding  Exceptional  Cases.  A  system  which  uses  outlier  status 
as  a  payment  variable  also  becomes  more  neutral  toward  these  exceptional  patients. 
To  the  extent  that  Medicare  wishes  hospitals  to  be  willing  to  take  on  exceptionally 
expensive  patients,  an  outlier  payment  policy  is  useful.  But  to  assure  true 
neutrality,  such  a  policy  might  need  to  pay  a  larger  portion  of  costs  than  would  be 
consistent  with  the  incentives  of  PPS.  Chapter  4. VI  examines  this  problem  in  detail. 
B.  ASSESSING  PATIENTS  AT  ADMISSION  v.  ASSESSING  THROUGHOUT  THE 
STAY. 

Differences  in  time  perspective  in  the  definition  of  severity  of  illness  contribute  to 
an  important  controversy  in  predicting  necessary  services:  can  a  prospective 
payment  system  legitimately  use  clinical  events,  procedures,  or  actual  costs  and 
charges  occurring  during  the  course  of  a  hospitalization  as  indicators  of  patient 
needs? 
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1.  Using  Clinical  Events.    The  use  of  events  occurring  during  hospitalization  to 

make  case  mix  adjustments,  and  ultimately  to  determine  payment,  arouses  intense 

feelings  among  program  planners  and  theorists  for  two  reasons: 

Using    events    occurring    during    hospitalization    to    determine    paym.ent 
suggests  a  reversion  to  cost  based  reimbursement,  and 
Considering  patient   condition  after   admission  as  a  basis   for  payment 
suggests  rewarding  ineffective  care. 
From  this  perspective,  it  is  useful  to  realize  that  PPS  already  considers  such 

data  in  three  important  ways: 

Operative  procedures  are  used  for  making  the  most  important 
classification  decisions  in  the  DRG  system,  and  there  is  evidence  that 
they  account  for  most  of  the  explanatory  power  of  the  DRG  system; 
Complications  occurring  during  hospitalization  are  used  to  make  further 
classification  decisions;  in  general,  PPS  pays  more  for  patients  who 
develop  complications;  and 

Law  prescribes  paying  marginal  cost  beyond  the  outlier  threshold. 
To   understand    the    implications   of   using   intrahospital    events    to    determine 

payments,   we  should  distinguish  between  paying   for  the  performance  of  clinical 

activities,   such  as  surgery,   and   using   certain   procedures   as   indicators   that   the 

patient  is  very  ill  and  merits  higher  payment. 

a.  Paying  for  Surgery.   The  fact  that  surgery  is  performed  can  permit 

a  significant  number  of  inferences  about  the  patient  if  the  principal  procedure  is 

typically  elective: 

the  diagnosis  was  known  prior  to  admission 

the  patient's  condition  on  admission  was  good  and  suitable  for  elective 

surgery 
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the  purpose  of  the  admission  was  to  perform  surgery 

the  resources  needed  will  be  largely  dictated  by  the  procedure  performed. 
Consistent  v.'ith  this,  a  number  of  investigators  have  found  that  surgical  DRGs  have 
far  less  internal  variation  than  nonsurgical  DRGs  (Mitchell  et  al.  1984,  1985a, b; 
West  et  al.  1985;  Frank  and  Lave  1985).  The  procedures  performed  can  also  provide 
Evidence  of  the  variation  in  therapeutic  intent  among  patients  who  may  appear 
identical  at  admission,  a  kind  of  variation  which  severity  of  illness  measurements 
are  unlikely  to  capture.  This  strategy  also  provides  neutrality  to  performance  of 
surgery,  which  is  important  for  surgical  referral  centers.  Since  reviewing  the 
clinical  appropriateness  of  major  surgery  is  feasible,  paying  for  surgery  provides  a 
substantial  reward  in  accuracy  at  the  cost  of  a  rather  modest  step  toward  cost- 
based  reimbursement.  Using  operative  procedures  as  a  measure  of  need  is,  however, 
troublesome  because  wide  regional  variation  in  frequency  of  procedures  raises  a 
question  about  the  relation  of  actual  procedures  to  necessary  procedures  (Chassin  et 
al.  1986;  Wennberg  et  al.  1984;  McPherson  et  al.  1982). 

b.  Paying  for  Indicator  Procedures.  A  more  serious  hazard  rises  when 
procedures  are  taken  as  indicators  of  severity  and  the  payment  rises  far  more  than 
the  cost  of  performing  the  procedure.  This  can  create  a  significant  incentive  to  the 
hospital  to  perform  unnecessary  procedures  because  the  payment  is  "leveraged,"  and 
certainly  provides  no  incentive  to  restrain  use  of  these  "indicator  services." 
Another  commonly  cited  risk  is  that  a  system  which  considers  care  actually 
rendered  rewards  poor  care  which  results  in  resource-consuming  complications. 
Although  this  is  at  least  theoretically  true,  there  are  so  many  pressures  to  avoid 
such  complications  (including  malpractice  litigation,  hospital  quality  assurance 
systems,  and  PROs)  that  it  may  not  be  strategically  fruitful  to  attempt  to  address 
this    problem    through    the    payment    system.       On    the    other    hand,    rewards    for 
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inefficient  care  are  a  legitimate  source  of  further  concern,  and  a  system  which 
takes  failure  to  improve  as  evidence  of  severity  may  have  misclassified  ineffective 
treatment,  which  should  be  penalized  rather  than  rewarded. 

3.  Using  Actual  Costs  or  Charges.  An  important  approach  to  case  mix 
classification  is  the  use  of  actual  costs  or  charges.  PPS  uses  actual  charges  for 
medically  necessary  services,  discounted  to  approximate  marginal  costs,  to 
determine  payments  for  cost  outliers  (Chapter  4).  The  outlier  system  illustrates 
how  actual  charges  or  costs  can  be  used  without  destroying  the  basic  incentives  of 
PPS:  the  substantial  loss  a  hospital  incurs  before  reaching  the  outlier  threshold 
preserves  the  incentives.  Nevertheless,  even  after  PRO  review  of  medical 
necessity,  actual  costs  or  charges  remain  only  estimates  of  efficiently  provided 
necessary  care. 

4.  Use  of  Services  and  Equity.  Determining  payment  from  services  rendered 
involves  a  tradeoff  between  two  kinds  of  equity.  On  the  one  hand,  considering 
actual  services  increases  equity  for  unusual  cases  which  cannot  be  predicted  and  for 
hospitals  receiving  difficult  referrals.  On  the  other  hand,  using  actual  services 
risks  overpaying  hospitals  which  practice  inefficient  or  counterproductive  medicine. 
C.  THE  CONCEPT  OF  A  "FULLY-SPECIFIED"  SYSTEM.  A  "fully-specified"  system 
would  use  all  available  medical  knowledge  to  define  necessary  services  for  a 
condition  and  would  then  determine  payment  from  the  cost  of  efficiently  providing 
these  services.  Attractive  though  such  an  idea  may  be,  even  the  goal  presents  three 
important  problems: 

1.  System  Development.  Developing  and  testing  such  a  system  might  well 
demand  a  larger  health  services  research  establishment  than  can  be  expected  to 
exist  in  the  United  States  during  the  duration  of  PPS. 
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2.  System  Maintenance.  Keeping  the  DRG  systenn  current  with  technology 
already  puts  unprecedented  demands  on  HCFA's  rulemaking  process;  extending  this 
process  to  the  full  range  of  medical  care  would  multiply  these  problems.  In 
addition,  continual  modifications  of  data  reporting  rules  would  put  severe 
administrative  strains  on  everyone  involved  with  HCFA  patients. 

3.  Appearance  of  Regulation.  DRGs  are  not,  and  were  not  intended  to  be, 
prescriptions  for  length  of  stay  or  value  of  services  to  be  provided  for  each  patient. 
But  some  hospital  managers  have  nevertheless  interpreted  mean  lengths  of  stay  as 
maximum  lengths  of  stay  and  interpreted  case  reimbursement  levels  as  caps  on  per 
case  expenditures.  This  practice  is  clearly  driven  by  profit  motives  or  confusion 
rather  than  by  respect  for  the  intention  or  the  content  of  Medicare  rules.  Per -case 
payments  based  on  a  detailed  analysis  of  what  should  be  done  for  patients  with  a 
particular  condition  might  also  be  interpreted  as  a  Medicare  cookbook  for  providing 
care.  This  criticism  has  already  been  raised  about  Young's  (1985)  design  of 
normative  weights  for  Patient  Management  Categories  (PMCs)  (Chapter  3.V). 

In  summary,  a  fully-specified  system  does  not  seem  to  be  a  realistic  or 
desirable  goal;  PPS  will  continue  to  depend  on  averages. 

IV.  EVALUATING  DRGs 
In  this  section  we  provide  a  detailed  evaluation  of  DRGs  as  a  tool  for  predicting 
costs  of  care.  In  the  next  chapter,  this  evaluation  of  DRGs  will  also  serve  as  a 
model  for  our  evaluation  of  alternative  systems  (summarized  in  Table  2.1).  Here, 
we  describe  how  DRGs  classify  cases,  the  variants  of  DRGs  which  have  been  used, 
the  face  validity  of  the  system,  its  data  requirements,  the  publicly  available 
documentation,  problems  of  subjectivity  and  vulnerability  to  manipulation,  the 
reliability  of  DRGs,  and  finally  their  ability  to  predict  necessary  care. 
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A.     CLASSIFICATION    STRUCTURE:    NONORDINAL    AND    ORDINAL    SYSTEMS. 
Case  mix  systems  can  take  two  major  forms. 

1.  Nonordinal  Systems.  DRGs,  Disease  Staging,  and  PMCs  create  a  large 
number  of  categories  which  are  clinically  distinct.  CSI  creates  such  categories,  but 
only  as  a  route  to  a  final  severity  score  which  is  intended  to  be  comparable  access 
categories.  Cases  in  each  category  are  expected  to  require  similar  clinical 
management  and  resources  but  calibration  with  actual  data  is  necessary  to  rank 
categories  from  most  to  least  costly  or  severe. 

2.  Ordinal  Systems.  The  APACHE  score  (APACHEII),  Medical  Severity  Groups 
MEDISGRPS,  son  and  CSI  assign  a  single  score  to  each  patient,  which  measures  how 
sick  the  patient  is;  Staging  assigns  a  stage  within  the  disease  category.  The  score 
orders  patients  from  least  sick  to  most  sick.  APACHE,  MEDISGRPS,  SOU  and  CSI 
scores  are  generally  used  in  conjunction  with  a  disease  oriented  classification 
system. 

Though  an  ordinal  system  which  produces  a  single  severity  score  appears  to  be 
very  simple,  the  effective  number  of  categories  multiplies  when  the  severity  score 
is  combined  with  diagnostic  categories  for  payment  purposes.  DRGs,  with  468 
categories,  are  actually  the  most  parsimonious  of  all  the  systems  we  shall  examine. 
For  payment  purposes,  the  data  available  to  determine  payment  rates  for  each 
category  are  the  only  limit  to  the  number  of  categories. 

Both  ordinal  and  nonordinal  systems  (PMCs  and  SOU)  may  be  unable  to  assign 
some  patients;  a  nonordinal  system  may  assign  patients  with  multiple  diagnoses  to 
multiple  categories  (PMCs,  Staging).  If  a  system  cannot  assign  a  patient,  there  must 
be  a  fallback  strategy  for  payment;  if  it  makes  multiple  assignments,  there  must  be 
a  rule  to  determine  payment  for  patients  with  multiple  assignments.  DRGs  classify 
all  patients  and  assign  each  patient  to  only  one  category. 

The   DRG   system    was  developed  at   Yale   University  by  John  Thompson  and 
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Robert  Fetter.  Health  Systems  International  holds  the  current  HCFA  contract  for 
updating  the  DRG  classification  program.  The  update  system  includes  refinements 
of  DRG  definitions  (e.g.,  HCFA  recently  changed  the  way  in  which  patients  can  be 
classified  in  the  kidney  transplant  DRG),  creation  of  new  DRGs  (e.g.,  HCFA  created 
a  new  DRG  for  multiple  joint  procedures),  and  restructuring  of  whole  groups  of 
DRGs  (e.g.,  the  entire  group  of  substance  abuse  DRGs  was  recently  restructured). 
Averill  (1986)  describes  HCFA's  current  procedure  for  updating  DRGs.  DRGs  are 
now  used  by  Medicare  and  a  number  of  other  payors  as  a  basis  for  payment  and  are 
the  only  classification  system  in  use  for  payment. 

The  DRG  system  has  a  "tree"  structure.  The  system  classifies  a  case  into  one 
of  23  Major  Diagnostic  Categories  (MDCs)  on  the  basis  of  the  principal  diagnosis. 
Usually,  MDCs  are  next  divided  into  medical  and  surgical  cases.  Finally,  medical 
and  surgical  cases  are  assigned  to  a  DRG  on  the  basis  of  presence  or  absence  of 
certain  procedures,  age  of  the  patient,  specific  principal  diagnosis,  presence  or 
absence  of  a  significant  comorbidity  or  complication,  and,  in  a  few  cases,  whether 
the  patient  died,  was  discharged  against  medical  advice,  or  was  a  transfer  from 
another  hospital.  There  are  ^71  DRGs,  including  DRG  ^68  for  cases  with  surgery 
that  is  inconsistent  with  the  principal  diagnosis  and  several  additional  DRGs  have 
been  proposed.  The  DRGs  were  developed  using  clinical  judgment  in  concert  with 
statistical  analysis  of  a  large  number  of  hospital  discharges. 

B.  VARIANTS.  All  of  the  systems  described  in  this  report  are  still  being  developed, 
usually  by  the  originator  but  sometimes  by  others  as  well.  In  consequence,  a  test  of 
one  version  of  the  system  may  not  apply  to  other  versions,  and  a  limitation  in  one 
version  may  not  be  present  in  another.  Different  versions  usually  constitute  a 
developmental  sequence,  but  in  some  systems  (Staging,  SOU)  versions  which  are 
being   developed   along   different   lines  are   incompatible   with   one  another.      The 
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existence  of  these  variant  versions  complicates  the  evaluation  of  the  case  mix 
systems. 

Due  in  part  to  its  relatively  long  history  and  in  part  to  its  actual  use  in  the  New 
Jersey  and  Medicare  PPSs,  the  DRG  system  exists  in  many  versions;  some,  written 
for  coding  systems  previous  to  ICD-9-CM  (Commission  on  Professional  Hospital 
Activities  1978),  are  primarily  of  historical  interest.  We  will  present  only  data  from 
the  1CD-9-CM  grouper  and  will,  when  possible,  distinguish  among  the  sequence  of 
revisions  which  HCFA  has  made  since  1982. 

C.  FACE  VALIDITY.  Face  validity  is  the  apparent  reasonableness  of  the  data  which 
the  system  uses  to  make  classifications  (to  be  discussed  further  in  Chapter  3.VIII) 
and  the  way  the  data  are  combined  to  predict  necessary  care.  There  are  five 
significant  face  validity  issues  for  DRGs: 

1.  Data  Limited  to  UHDDS.  We  noted  that  adopting  DRGs  limited  PPS  to  using 
UHDDS  data  items  and  that  these  contain  almost  no  direct  evidence  as  to  the 
severity  of  the  illness  and  very  little  as  to  its  acuity. 

2.  Using  a  Uniform  List  of  Comorbidities/Complications  for  all  MDCs.  When  a 
uniform  list  is  used,  a  secondary  diagnosis  which  is  no  more  than  a  manifestation  of 
the  principal  diagnosis  may  be  interpreted  as  a  comorbidity— for  example,  angina  is 
a  comorbidity  if  a  patient  has  pneumonia  but  only  a  feature  of  the  disease  if  the 
principal  diagnosis  is  coronary  atherosclerosis.  This  contrasts  with  the  careful 
search  for  patterns  of  diagnosis  which  we  will  see  in  PMCs  and  Staging.  Further,  all 
comorbidities  have  equal  weight — candidal  otitis  externa  is  given  the  same 
importance  as  candidal  meningitis. 

3.  Use  of  Procedures  for  Classification.  Using  procedures  for  classification  is, 
as  noted  above,  a  kind  of  cost-based  reimbursement. 
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4.  Categories  Limited  to  About  500.  As  we  have  noted,  this  limitation  restricts 
the  ability  of  DRGs  to  deal  with  unusual  conditions  even  when  they  can  be  clearly 
identified. 

D.  AVAILABILITY  OF  DOCUMENTATION.  Systems  differ  in  the  degree  to  which 
they  are  in  the  public  domain  and  the  degree  to  which  they  are  open  for  public 
inspection  and  research.  There  are  several  ways  in  which  documentation  can  be 
unavailable: 

A  system   may  be  copyrighted  and  the  owners  may  refuse  to  allow  any 

dissemination  of  the  key  materials  (e.g.,  MEDISGRPS). 

A  systein   may  be  embodied  in  a  complex  computer  program  for  which 

there  is  little  documentation  (e.g.,  PMCs). 

A  system  may  require  special  training  of  data  coders  for  which  no  training 

manual  is  publicly  available  (e.g.,  SOU). 
In  each  case,   the  ability  of  independent  investigators  to  study  the  reliability  or 
performance  of  the  system   is  limited,  and  potential  users  are  forced  to  rely  for 
information  on  parties  who  have,  at  least,  an  emotional  investment  in  the  system 
and  often  a  large  personal  financial  stake  in  its  adoption. 

Extensive  documentation  about  the  DRG  system  is  in  the  public  domain.  In 
addition  to  published  articles  (Fetter  et  aL  1980)  and  final  reports  of  grant  projects 
(Fetter  et  al.  1982),  the  PPS  system  itself  is  documented  in  the  Federal  Register 
(1983,  1984,  1985).  Detailed  descriptions  of  the  workings  of  the  DRG  assignment 
algorithm  and  a  computer  program  to  implement  the  algorithm  are  available  from  a 
private  company  (Averill  et  al.  1983). 

E.  DATA  REQUIREMENTS.  All  hospitals  must  provide  the  UHDDS  (see  Section  I.A 
of  this  chapter)  for  each  Medicare  discharge  and  this  information  is  available  in 
computerized  form.     Some  systems  can  classify  patients  using  UHDDS  data  alone, 
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others  require  data  which  are  readily  abstracted  from  the  medical  record,  and 
others  require  data  which  can  only  be  identified  by  specially-trained  abstractors.  A 
system  using  UHDDS  imposes  no  data  collection  burdens  and  data  to  calculate 
payment  weights  for  such  a  system  are  readily  available.  DRGs  can  operate 
completely  from  UHDDS. 

Theoretically,  the  burdens  of  a  system  requiring  special  data  collection  could  be 
reduced  by  collecting  data  on  a  sample  of  cases  within  each  hospital  rather  than  all 
cases,  but  the  feasibility  of  this  strategy  cannot  be  estimated  from  available  data 
for  any  of  the  systems  which  require  special  data  collection. 

F.  SUBJECTIVITY,  VULNERABILITY  TO  MANIPULATION,  AND  RELIABILITY. 
Subjectivity  and  vulnerability  to  manipulation  are  not  identical  to  unreliability. 
Unreliability  means  only  that  the  system  does  not  always  make  the  same  assignment 
for  the  same  case;  even  abstracting  a  number,  such  as  a  patient's  temperature,  is 
subject  to  error  and  can  lead  to  unreliability.  But  subjectivity  means  that  the 
abstractor  must  make  a  judgment  for  which  rules  are  not  absolutely  clear,  so  that 
subjectivity  creates  a  risk  of  manipulation  (conscious  or  unconscious)  by  abstractors 
with  differing  goals.  Subjectivity  can  be  discerned  simply  by  examining  the  way  the 
system  is  put  together,  while  we  must  measure  actual  performance  of  the  system  to 
determine  reliability. 

1.  Subjectivity.  The  danger  of  subjectivity  is  that  a  measure  which  is  reliable  in 
the  hands  of  abstractors  with  shared  goals  may  still  be  highly  vulnerable  to 
manipulation.  In  research,  abstractors  may  consciously  or  unconsciously  use  the 
subjectivity  of  the  system  to  make  coding  predict  costs  more  accurately  and  thus  to 
inflate  the  system's  apparent  performance.  In  payment  policy,  the  important 
question  is  whether  an  abstractor  who  seeks  to  maximize  Medicare  payments  (e.g., 
one  working  for  the  hospital)  will  code  differently  from  one  who  seeks  to  minimize 
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Medicare  payments  (e.g.,  one  working  for  Medicare).  We  must,  therefore,  be 
critically  concerned  as  to  whether  there  are  procedures  for  resolving  differences  in 
judgment  and  whether  they  would  hold  up  when  large  amounts  of  money  were  at 
stake. 

DRGs  require  four  judgments  with  subjective  elements:  making  the  diagnoses, 
coding  the  diagnoses,  coding  the  procedures,  and  selecting  the  principal  diagnosis. 

Making  diagnoses  involves  considerable  clinical  judgment  and  some  kinds 
of  judgment  which  are  less  clinical.  lezzoni  and  Moskowitz  (1986)  give 
the  example  of  a  patient  with  chest  pain  who  might  be  coded  as  having 
chest  pain,  angina  pectoris,  or  arteriosclerotic  coronary  disease,  resulting 
in  three  very  different  payments.  Although  procedures  for  resolving 
diagnostic  disagreement  are  better  established  than  procedures  for 
settling  some  other  coding  controversies,  a  subjective  element  remains. 
Demlo  and  Campbell  (1981)  found  that  independent  coders  could  not 
resolve  about  5-11  percent  of  disagreements,  but  the  more  explicit  rules 
of  ICD-9-CM  may  have  reduced  this  ambiguity.  One  source  of 
unresolvable  ambiguity  is  that  the  diagnostician  can  often  choose  between 
a  sign  or  symptom  such  as  chest  pain,  coma,  or  anemia,  and  a  pathological 
diagnosis  such  as  myocardial  infarction,  drug  overdose,  or  colon  cancer 
(lezzoni  and  Moskowitz  1986).  Although  Medicare  rules  defining  principal 
diagnosis  clearly  give  preference  to  listing  the  pathological  diagnosis 
first,  there  is  wide  room  for  choice.  Consistent  "upcoding"  of  marginal 
cases  can  lead  to  higher  case  mix  indices  with  the  same  mix  of  cases 
(Simborg  1981;  lezzoni  and  Moskowitz  1984,  1986,  Gertman  and 
Lowenstein  1984). 
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Coding  is  also  subjective  because  there  is  ambiguity,  even  in  Medicare 
policy,  as  to  how  active  a  secondary  diagnosis  must  be  in  order  to  be 
coded.  A  coding  convention,  which  aggravates  the  problem,  requires  an 
abstractor  to  code  any  diagnosis  which  the  physician  says  must  still  be 
ruled  out  at  the  time  of  discharge. 

Procedure  coding  involves  significant  subjectivity  because  DRGs  require 
procedures  coded  in  ICD-9-CM,  and  this  procedure  coding  system  is  quite 
limited  and  somewhat  obsolete.  Thousands  of  dollars  a  case  can  ride  on  a 
controversy  such  as  whether  percutaneous  transluminal  coronary 
angioplasty  is  to  be  given  the  same  code  as  coronary  angiography  (to 
which  it  is  similar  in  method)  or  as  coronary  artery  bypass  grafting  (to 
which  it  is  similar  in  purpose).  Such  problems  can  be  resolved  definitively 
and  subjectivity  removed  once  they  are  identified  and  a  rule  is  issued,  but 
the  system  remains  subjective  in  the  interim. 

Sequencing  diagnoses  is  also  subjective  and  carries  high  cost  implications 

because   the    first-listed   or    principal    diagnosis    is    the    major    basis    for 

determining   DRG   assignment.     Coding  conventions  clearly  require  that 

the  diagnosis  found  to  be  the  cause  of  the  admission  is  listed  first,  but  the 

conventions  are  generally   less  clear   on   whether  to   list  the  underlying 

disease  or  the  condition   which  caused  the  admission.     For  example,  a 

patient  admitted  with  pneumonia  secondary  to  advanced  cancer  might  be 

fairly  given  a  principal  diagnosis  of  either  pneumonia  or  cancer. 

2.  Reliability.     The  reliability  of  a  case  mix  classification  system  depends  on 

the  accuracy  with  which  the  necessary  data  are  coded  and  abstracted  from   the 

medical  record  and  the  reproducibility  of  classification  from  the  abstracted  data. 

The  reader  should  be  alert  to  the  way  in  which  reliability  is  reported.    If  a  system 
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has  95  percent  type  A  cases  (nonoutliers,  for  example)  and  5  percent  type  B  cases 
(outliers),  two  ratings  could  be  90  percent  reliable  and  never  agree  on  whether  a 
case  was  type  B.  Since  chance  agreement  is  much  less  likely  in  a  system  with  a 
large  number  of  categories,  some  adjustment  is  needed  to  compare  systems  with 
different  numbers  of  categories  (for  example,  PMCs  and  SOU).  The  statistic  which 
corrects  for  this  error  is  called  kappa,  but  it  is  not  commonly  used.  Reliability  may 
also  be  a  more  important  issue  for  systems  with  a  small  number  of  categories  which 
have  very  different  payment  implications  than  systems  which  have  many  categories 
with  very  similar  payment  implications  (see  discussion  of  coding  accuracy  below). 

Computerized  steps  in  classification  tend  to  be  reliable  even  if  there  are 
programming  errors,  although  occasionally  computer  programs  will  give  different 
results  running  on  different  computers.  Experience  indicates  that  programming 
errors  affecting  validity  are  relatively  common,  but  those  affecting  reliability  are 
rare. 

Studies  of  DRG  reliability  (see  Cretin  and  Worthman  1986)  have  focused  on  two 
questions:    the  accuracy  of  coding  and  changes  in  response  to  the  incentives  of. PPS. 

a.  Coding  Accuracy.  Agreement  in  coding  principal  diagnoses  using  the  older 
ICD-8  classification  scheme  is  generally  reported  to  be  between  60  and  70  percent, 
depending  on  whether  agreement  is  at  the  three  digit  or  four-digit  level  (lOM  1977a; 
lOM  1977b;  lOM  1980;  Demlo  and  Campbell  1981).  At  the  level  of  accuracy  needed 
to  assign  a  DRG,  the  agreement  was  just  over  70  percent.  It  is  important,  however, 
to  know  the  impact  of  disagreement  on  CMI.  Using  ICD-9-CM  DRGs,  Johnson  and 
Appel  (1984)  found  disagreement  in  DRG  assignment  in  47  percent  of  the  cases,  but 
the  average  impact  on  the  CMI  was  4  percent,  with  no  more  than  a  2  percent 
difference  in  77  percent  of  hospitals. 

b.  Response  to  Incentives.  Carter  and  Ginsburg  (1986)  (Table  2.2)  found  that 
the  CMI  rose  9.2  percent  from  1981  to  1984,  of  which  they  attributed  1.4  percent  to 
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a  pre-PPS  trend,  0.6  percent  to  PPS  associated  shifts  to  outpatient  care,  and  7.0 
percent  to  coding  changes.  Of  the  7.0  percent,  a  one-time  jump  of  4.0  percent 
resulted  from  shifting  to  a  new  abstracting  form,  which  contained  more  information, 
but  a  further  jump  of  2.8  percent  appears  to  reflect  a  self-limiting  change  in  coding 
practices.  Carter  and  Ginsburg  (1986)  could  not  determine  whether  this  2.8  percent 
change  resulted  from  simply  coding  more  carefully  or  whether  it  reflects  "gaming," 
but  2.8  percent  is  less  than  the  gap  which  Johnson  and  Appel  (1984)  found  between 
actual  coding  and  correct  coding.  There  may  also  be  room  for  further  creep  in  the 
CMI  because,  even  after  the  beginning  of  PPS,  abstracts  coded  by  hospitals  for  an 
abstracting  service  yield  a  3  percent  higher  CMI  than  abstracts  coded  by  the  same 
hospitals  for  HCFA.  Carter  and  Ginsburg  point  out  that  coding  for  Medicare  was 
done  sooner  after  discharge  than  coding  for  the  abstracting  service,  and  some 
Medicare  coding  is  being  done  by  hospital  business  offices  rather  than  medical 
records  personnel;  either  difference  may  explain  the  CMI  difference. 
G.  PREDICTIVE  POWER.   In  examining  predictive  power  we  will  consider: 

—what  is  being  predicted, 

—the  data  used, 

—the  statistical  methods,  and 

—whether  the  prediction  is  for  individual  patients  or  for  hospitals. 
A  system  is  more  credible  if  investigators  other  than  thg^developers  have  carried 
out  studies  and  if  separate  data  sets  are  used  to  create  payment  weights  and  to  test 
predictive  power.  Predictive  tests  carried  out  on  the  same  data  set  used  in 
developing  a  systein  (even  when  that  data  set  is  "representative")  usually  make  a 
system  look  better  than  it  will  look  on  a  new  set  of  cases.  Although  this  type  of  bias 
can  be  reduced  by  the  use  of  adjusted  statistics,  such  adjustments  are  not  always 
made. 
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1.  What  is  Being  Predicted.  Although  we  would  hope  to  evaluate  DRGs  against 
the  gold  standard  of  predicting  necessary  care,  almost  no  evidence  on  this  issue  is 
available.  Investigators  have,  at  best,  studied  the  ability  of  DRGs  to  predict  actual 
costs  and,  in  many  cases,  studies  have  examined  only  ability  to  predict  charges  or 
length  of  stay. 

2.  Data.  The  sample  of  patients  and  hospitals  used  to  test  performance  can 
have  two  effects. 

a.  Exaggerated  Apparent  Performance.  The  performance  of  a  system  in 
classifying  patients  from  just  a  few  hospitals  may  give  a  biased  indication  of  the 
performance  of  the  system  on  a  national  sample  of  hospitals.  This  is  especially  true 
if  all  the  hospitals  in  a  sample  are  from  the  same  region  of  the  country  or  are  about 
the  same  size  or  have  the  same  level  of  teaching  involvement.  For  example,  if  data 
come  from  hospitals  with  relatively  homogeneous  practice  styles  (compared  to  the 
whole  country),  the  system  may  do  much  better  than  it  would  on  a  national  data  set. 
This  may  be  why  Horn  found  that  DRGs  accounted  for  75  percent  of  cost  variation 
in  a  sample  of  15  "mostly  large,  teaching"  hospitals,  in  contrast  to  the  25-40  percent 
of  the  variance  more  typically  reported  (Table  2.5). 

b.  "Volunteer"  Effects.  The  performance  of  a  system  in  a  group  of  "volunteer" 
hospitals  may  not  represent  its  potential  in  randomly  selected  or  mandated 
hospitals.  DRGs  were  developed  on  a  sample  from  th.g  nation's  largest  voluntary 
abstracting  system;  all  other  systems  except  PMCs  were  originally  developed  and 
tested  on  "volunteer"  hospitals  or  (in  the  case  of  APACHE)  on  selected  cases  within 
"volunteer"  hospitals.  Often  these  "volunteers"  have  actually  subscribed  to  the 
system  for  internal  hospital  management.  These  volunteers  are  likely  to  be  an 
unrepresentative  set  of  hospitals,  and  paying  subscribers  are  likely  to  be  much  more 
motivated  to  make  the  system  work  accurately  than  hospitals  seeking  Medicare 
payments. 
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For  these  reasons,  comparisons  between  two  systems  working  on  a  single  data 
set  are  much  more  informative  than  contrasting  figures  derived  from  different  data 
sets.  The  variation  in  predictive  power  of  DRGs  between  States  (Table  2.3)  suggests 
the  risks  of  comparing  performance  of  two  different  systems  on  different  data. 
Because  DRGs  have  been  a  benchmark  for  assessing  performance  of  other  systems, 
some  of  the  results  reported  here  were  developed  incidental  to  research  on  other 
systems. 

We  might  expect  that  classification  systems  would  perform  better  on  systems 
where  care  was  more  uniformly  appropriate.  However,  Kay  et  al.  (1986)  (Chapter 
4.1V.B.1)  found  no  improvement  in  the  performance  of  DRGs  in  a  data  set  from 
which  apparently  inappropriate  days  were  removed  compared  to  the  same  data  set 
before  inappropriate  days  were  removed.  This  suggests  that  rigorously-defined 
appropriateness  of  care  may  not  be  necessary  in  data  sets  for  comparing  systems. 

3.  Statistical  Methods.  The  reader  needs  to  appreciate  three  features  of  the 
statistics  used  to  measure  system  performance: 

--whether  they  measure  overall  predictive  power  or  measure  ability  to  classify 
patients  with  patients  whose  needs  are  similar, 

—  whether  they  are  highly  sensitive  to  unusual  cases,  and 

—  whether  they  are  applied  to  individual  patients  or  to  averages  for  hospitals. 

a.   Total   Predictive   Power   v.    Homogeneity   of    Categories.      The    two    most 

frequently  used  statistics  for  measuring  predictive  power  are 

2 

—  explained  variance  (R  ),  which  measures  the  percent  of  total  variation  in  cost 

which  the  system  predicts,  and 

—  coefficient  of  variation  (CV),  which  measures  the  variation  in  cost  among  a 
group  of  patients  as  a  percentage  of  the  average  cost  for  that  group. 
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2 
In    general,    R      comes   closer    to    our    intuitive    notion   of    what   a    case    mix 

measurement  system  should  do,  but  CVs  are  frequently  necessary  when  examining  a 

part  of  a  case  mix  measurement  system  or  when  the  classification  system  cannot 

classify  all  patients. 

2 
b.  Sensitivity  to  Unusual  Cases.    Both  R     and  CV  are  squared  error  measures. 

This    means    that    variation    from    predicted    cost    (error)    is   squared   before    it    is 

averaged.    This  practice  is  usual  in  statistics,  but  it  means  that  a  few  unusual  cases 

can  dominate  the  results.    For  example,  a  system  which  predicts  case  costs  exactly 

in  99  cases  and  misses  by  $1,001   in  one  case  will  appear  to  perform  worse  than  a 

system  which  misses  by  $100  on  all  100  cases.    The  impact  of  squared  error  methods 

is    clear    in    Table    2.3,    where    removing    small   numbers    of   outliers   dramatically 

improves  the  performance  of  case  mix  systems.     The  reader  should  be  aware  that 

the  number  of  outliers  removed  in  these  "trimmed"  studies  is  substantially  smaller 

than  the  number  which  is  paid  under  the  HCFA  outlier  policy.    Thus,  in  comparing 

methods  the  reader   must  be  alert  to  whether  the  data  have  been  trimmed  in  a 

uniform   way.     The   reader   should  also  be   aware   that  using   untrimmed  data   to 

compare  DRGs  to  an  alternative  requires  that  DRGs  classify  outliers  in  a  way  which 

predicts  cost.    PPS,  because  of  the  outlier  payment  system,  does  not  require  that 

DRGs  classify   outliers   in   a    way    which  predicts  costs.      Thus  comparisons  using 

untrimmed  data  are  not  ideal  for  deciding  how  classification  systems  will  perform  as 

part  of  PPS. 

Another    approach    is    to    measure    average   deviation   or   average    error,    and 

methods  which  do  this  are  often  called  "robust"  because  they  are  less  sensitive  to 

outlier  cases.     This  approach  is  commonly  used  in  economic  impact  studies  (e.g., 

Horn  et  al's.,   (1985b)  projection  of  the  impact  of  using  SOU   to  refine   PPS  and 

Mitchell  et  al.'s,  (1985a)  estimates  of  the  impact  of  paying  physicians  with  DRGs), 


2-31 


and  Cretin  and  Worthman  (1986)  use  robust  statistics  to  assess  the  impact  of  using 
Disease  Staging  to  refine  DRGs  (Chapter  3).  Cretin  et  al.  (1986)  use  another  kind 
of  robust  regression  to  assess  the  relationship  of  hospital  characteristics  to  profit 
under  PPS;  here,  outlier  cases  are  given  less  weight  in  an  otherwise  conventional 
measurement.  While  robust  statistics  reflect  the  way  hospitals  compute  profits  and 
losses,  they  are  not  widely  used  or  understood.  In  economics,  however, 
computations  often  involve  logarithms;  these  calculations  tend  to  be  more  robust 
than  calculations  based  on  raw  numbers. 

c.  Case  Level  v.  Hospital  Level  Analyses.  Because  most  experimental  studies 
are  conducted  in  just  a  few  hospitals,  the  predictive  power  of  the  system  is  usually 
measured  for  individual  patients.  This  approach  gives  more  information  about  the 
ability  of  the  system  to  protect  individual  patients  against  discrimination,  skimming, 
and  dumping.  However,  the  aim  of  PPS  is  to  pay  hospitals  accurately  (Chapter  1), 
and  hospital-level  analyses  are  more  revealing  for  this  purpose  (Chapter  4).  In 
principle,  one  would  expect  the  results  to  be  quite  different,  with  some  kinds  of 
variation  averaging  out  at  the  hospital  level  and  other  kinds  of  variation  being  quite 
important  between  hospitals.  For  example,  Medicaid  eligible  Medicare  beneficiaries 
have  almost  exactly  the  same  charges  as  other  beneficiaries,  but  hospitals  with  a 
high  percentage  of  Medicaid  patients  are  reported  to  have  higher  costs  per  case 
(Gordon  1985,  Altman  1985).  Finally,  patient  level  analysis  is  much  easier  to 
understand  and  assess:  a  clinician  or  even  a  layman  can  look  at  two  patients  and 
make  some  evaluation  of  whether  they  should  have  been  paid  equally.  The  same 
judgment  is  much  more  difficult  when  looking  at  whole  hospitals. 

4.  DRG  Performance  at  the  Case  LeveL  When  viewed  at  the  case  level,  using 
reduction  in  variance  as  the  statistical  criterion,  DRGs  are  clearly  helpful  in 
explaining  variations  in  resource  use.     In  a  variety  of  studies  (Table   2.3)   DRGs 
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explain  from  17  to  30  percent  of  the  variation  in  costs  in  untrimmed  data  and  from 
30  to  48  percent  of  the  variation  in  cost  on  trimmed  data  (Mitchell  et  al.  1984; 
Mitchell  et  al.  1985a).  Performance  in  explaining  charges  is  similar  to  that  for 
costs  (West  et  al.  1985;  Cotterill  et  al.  1985). 

As  we  noted  in  section  III.B.l.a,  surgical  DRGs  consistently  out  perform 
medical  DRGs  (Table  2.4).  At  the  case  level,  surgical  DRGs  explain  48  to  57 
percent  of  the  variance  in  charges  in  trimmed  data,  while  medical  DRGs  only 
explain  9  to  16  percent  of  the  variance  (Mitchell  et  al.,  1984,  1985a). 

5.  DRG  Performance  at  the  Hospital  Level.  The  DRG-based  hospital  CMI 
accounts  for  41  percent  of  variation  in  hospital  average  cost  per  Medicare  case 
(Table  4.2).  The  CMI  combined  with  the  other  variables  used  in  reimbursement  (the 
wage  index,  an  urban/rural  designation  and  the  house  staffA^ed  ratio),  explains  66  to 
69  percent  of  costs  (Table  4.1)  or  charges  (Cotterill  et  al.  1985)  (Table  2.5). 

One  study  of  15  hospitals  reported  that  the  DRG-based  CMI  alone  explained  75 
percent  of  the  variance  in  cost  per  case  (Horn  et  al.  1984b)  (Table  2.5),  but  the 
sample  was  small  and  hospitals  were  more  similar  than  in  national  samples. 
H.  SUMMARY.  This  chapter  has  examined  the  problems  of  predicting  the  cost  of 
necessary  care  for  individual  patients.  Section  I  laid  out  the  predictive  methods 
implicit  in  DRGs,  Section  II  explored  the  ambiguities  in  the  idea  of  necessary  care 
and  the  policy  problems  latent  in  these  ambiguities,  Section  III  explored  the  issues  in 
choosing  variables  to  predict  necessary  care,  and  Section  IV  established  a  framework 
for  comparing  alternative  case  mix  systems  using  the  evaluation  of  DRGs  as  an 
example.  In  the  next  chapter  we  apply  this  framework  to  the  evaluation  of 
alternative  case  mix  systems. 
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Table  2.1 
BASIC  FEATURES  OF  ALTERNATIVE  CASE  MIX  MEASUREMEWT  SYSTEMS 


SYS'l'Eiyi 
NAME 

CREATOR 

ORGAN- 
IZATICN 

ORDINAL/     NUMBER  OF      DATA 
]NP3N[-        GROUPS/SCALE  REQUIRE-    PRO- 
ORDINAL*       POINTS      MhWl'S    PRIEIARY 

Diagnosis- 
Related 
Groups 
(DRGs) 

Thonpson 
Fetter 
Aver ill 

Yale 

Nbnordinal 

468 

UHDDS 

Nb 

?oate       Knaus 
Physiology   Wagner 
and  Chronic 
Health  Evaluation 
(APACHE) 

George 

Washington 

University 

Ordinal 

71 

specially 
abstracted 
physiologic 
c3ata 

No 

Disease 
Staging 
(Staging) 

Gonella 

Thomas 

Jefferson 

University 

Nbnordinal** 
4 

420       UHDDS 
groups 
levels  each) 

Software 
yes; 

architect- 
ure no 

Medical 
Illness 
Severity 
Grouping 
System 
(MEDISGRPS) 

Brewster  MediQual 
Inc. 

Ordinal 

5 

specially 
abstracted 
history  and 
physiologic 
data 

Software 
and  archi 
tecture 
secret . 

Patient 
Management 
Categories 
(FMCs) 

Young 

Blue  Cross  Nbnordinal*** 
of  Western 
Pennsylvania 

720 

UHDDS 

Nb 

Severity 
of  11  Mess 
Index 
(SOU) 

Horn 

Johns 

Hopkins 

University 

Ordinal 

5 

specially 
coded  scales 
"implicitly" 

integrated 

Nb,  but 

training 

manuals 

not 

public 

Conputer- 
ized 

Severity 
Index 
(CSI) 

Horn 

Johns      Nbnordinal** 

Hopkins  & 

Health 

Systems 

International 

5 

specially 
abstracted 
physiologic 
data 

Incom- 
plete 

*ln  an  ordinal  system,  patients  have  greater  severity  or  care  needs  than  patients 

with  lower  numbers  and  less  than  patients  with  higher  numbers. 

**  Stages  are  ordinal  within  each  staged  disease  category.  CSI  levels  are  ordinal 

within  a  diagnosis. 

***  HMCs  are  ordinal  within  major  disease  category. 


TABLE  2.2 

DECOMPOSITION  OF  CMI  INCREASE  ^ 

In  Percentage  Points 

CY  1981-FY  1984  CMI  increase 9.2 

Medical  practice  changes 

Pre-PPS  trend 1.4 

P PS-associated  shifts  to  outpatient 0.6 

Setting  for  lens  procedures 0.7 

Setting  for  chemotherapy , -0.1 

Other  outpatient  substitution 0.1 

Total,  medical  practice  changes 2.0 

Older  patients 0.0 

Coding  practice  changes 

PPS-induced 2.8 

MEDPAR/PATBILL  difference      4.0 

Total,  coding  practice  changes 7.0 

NOTE:   Numbers  are  multiplicative  instead  of  additive. 
For  example,  1.028  *  1.040  =  1.070. 

Source:  Carter  &  Ginsburg  1986 
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CHAPTER  3 
EVALUATION  OF  ALTERNATIVE  CASE  MIX  MEASURES 
This  chapter  uses  the  framework  established  in  the  last  chapter  for  examining 
DRGs  to  evaluate  five  case  mix  adjustment  systems  which  have  been  proposed  as 
replacements  or  refinements  to  DRGs—APACHE,  Staging,  MEDISGRPS,  PMCs,  and 
son.  Section  I  describes  three  possible  uses  for  alternative  systems,  Sections  II-VI 
examine  the  five  alternative  systems,  Section  VII  draws  on  this  and  the  previous 
chapter  to  assess  the  value  of  different  predictors  of  resource  use,  and  Section  VIII 
summarizes  the  usefulness  of  the  systems  for  PPS. 

I.  CRITERIA  FOR  DIFFERENT  SYSTEM  USES 
PPS   needs   case    mix    adjustment    systems    for    research    and    for    refining   or 
replacing  DRGs  as  payment  tools;  hospital  managers  need  such  systems  for  internal 
management.   These  uses  place  different  demands  on  a  system: 

A.  RESEARCH.  PPS  needs  research  instruments  that  can  identify  its  strengths  and 
shortcomings—criterion  systems  which  are  closer  to  the  gold  standard  we  have 
discussed.  In  some  ways,  a  research  system  can  meet  less  stringent  requirements 
than  a  systern  for  payment  because  it  can: 

—  use  data  collection  procedures  which  would  be  too  cumbersome  for  a  payment 
system,  and 

—  use  measures  which  are  subject  to  manipulation  (especially  if, the  research 
data  are  collected  retrospectively  when  the  system  cannot  be  "gamed"). 

Classification  methods  based  on  more  subjective  measures  can  be  used  in  research, 
provided  that  care  is  taken  in  training  and  double  checking  the  coders.  But  we  have 
already  observed  that  research  coders  may  inadvertantly  game  systems  to  improve 
research  results  just  as  coders  in  a  reimbursement  system  may  game  it.  A  research 
system  clearly  must  be  more  valid  or  more  reliable  than  DRGs  to  evaluate  them. 
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B.  REFINING  AND  REPLACING  DRGs.  A  system  for  refining  DRG  payments  need 
not  be  as  precise  as  one  for  research  because  there  is  greater  protection  from  the 
law  of  large  numbers.  A  payment  system  must,  however,  be  even  more  reliable  and 
resistant  to  manipulation  than  a  research  system  and  it  must  produce  appreciable 
improvements  in  the  extent  to  which  similar  cases  are  classified  together.  A 
refinement  system  need  not  be  able  to  classify  all  cases  since  unclassifiable  cases 
can  be  paid  under  existing  DRGs  after  appropriate  recalibration,  but  a  system  to 
replace  DRGs  must  be  able  to  classify  all  cases. 

C.  MANAGEMENT.  We  have  pointed  out  (Chapter  l.IU.A)  that  the  usefulness  of  a 
system  to  hospital  managers  is  not  a  measure  of  its  usefulness  to  PPS.  Hospital 
managers  need  a  system  which  is  more  precise  than  that  used  for  PPS  payment 
because  they  must  assess  performance  of  individual  physicians  and  departments 
while  PPS  need  only  (Chapter  1)  pay  hospitals  accurately.  Further,  hospital 
managers  can  use  a  system  which  is  subjective  and  subject  to  gaming  if  their  control 
of  the  abstracting  process  protects  them  from  being  gamed  by  physicians  and 
departments.  Finally,  managers  may  be  more  interested  in  the  system's  value  in 
quality  assurance,  a  concern  of  greater  interest  to  the  PRO  system  than  to  the 
payment  system.  Thus,  while  we  will  examine  use  of  the  alternative  systems  for 
evaluation,  refinement  and  replacement  of  DRGs  for  payment  purposes,  we  will  not 
focus  on  their  usefulness  in  hospital  management,  and  we  will  not  consider  the  fact 
that  some  systems  have  proven  useful  to  managers  as  evidence  that  they  would  be 
useful  for  PPS. 

II.  ACUTE  PHYSIOLOGY  AND  CHRONIC  HEALTH  EVALUATION  (APACHE) 
Knaus  and  associates  at  George  Washington  University  developed  the  APACHE 
system.     APACHE  was  designed  to  assess  the  probability  of  death  and  severity  of 
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illness  for  Intensive  Care  Unit  (ICU)  patients  (Wagner  and  Draper  1984;  Knaus  et  al. 
1981).  APACHE  has  gone  through  several  rounds  of  development  and  evaluation, 
and  use  in  reimbursement  is  a  recent  proposal  (Wagner  and  Draper,  1984). 

A.  CLASSIFICATION  STRUCTURE.  APACHE  creates  a  score  for  the  whole  patient 
rather  than  classifying  individual  diseases  or  diagnoses,  and  the  score  is  independent 
of  diagnosis.  APACHEII  (the  current  version)  builds  a  total  score  from  three 
components: 

1.  The  Acute  Physiology  Score  (APS12),  based  on  the  values  of  12  physiologic 
variables  (e.g.,  temperature,  pulse,  respiratory  rate,  and  arterial  blood  gases),  which 
is  determined  within  24  hours  of  admission  to  the  Intensive  Care  Unit  (ICU).  Each 
physiologic  variable  contributes  points  to  the  overall  score  according  to  a 
mathematical  formula  reflecting  how  far  the  patient's  value  deviates  from  "normal"; 

2.  Age  points  for  five  categories  of  age;  and 

3.  Chronic  health  evaluation  points  based  on  a  history  of  organic  insufficiency 
according  to  a  list  of  conditions. 

The  overall  score  is  intended  to  be  an  ordinal  scale,  which  means  that  a  patient 
with  a  higher  score  is  more  likely  to  die  than  a  patient  with  a  lower  score,  but  that 
the  difference  is  not  necessarily  proportional  to  the  difference  in  score.  APACHEII 
values  can  range  from  0  to  a  theoretical  maximum  of  71;  55  is  the  highest  value  yet 
observed.  The  APS12  component  can  contribute  60  of  the  71  points  (Knaus  et  at. 
1985,  Wagner  and  Draper  1984). 

B.  VARIANTS.       The    original    APACHE,    with    34    variables    in    its    physiologic 
component,  has  been  replaced  by  the  12  variable  APS  score.    The  developers  have 
suggested  that  a  more  parsimonious  set  of  physiologic  predictors  could  be  developed^ 
for  reimbursement  (Wagner  and  Draper  1984). 

C.  DATA  REQUIREMENTS.     APACHE  requires  physiologic  and  test  data  from  the 
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first  24  hours  of  admission  to  the  ICU.  Because  these  data  elements  are  not 
routinely  abstracted  in  most  hospitals,  customized  abstraction  forms  are  used  to 
collect  the  data.  Abstracting  costs  would  be  lower  if  some  of  the  APS12  elements 
could  be  obtained  directly  from  an  automated  laboratory  reporting  system  or 
automated  hospital  record  system.  In  the  long  run,  as  hospitals  put  more  and  more 
patient  information  on  centrally  accessible  computer  systems,  a  system  like 
APACHE  (or  MEDISGRPS)  might  be  redesigned  to  eliminate  the  need  for  special 
abstraction  forms.  Such  technological  changes  could  dramatically  reduce  costs.  At 
the  present,  however,  APACHE,  like  other  systems  using  physiologic  data,  would 
create  a  significant  abstracting  and  reporting  burden. 

D.  FACE  VALIDITY.  Because  APACHE  was  designed  to  assess  the  likelihood  of 
death  for  ICU  patients  (Knaus  et  al.  1985),  its  origins  are  more  explicitly  clinical 
than  those  of  any  other  case  mix  system,  but  this  clinical  origin  may  be  less  useful 
when  assessing  patients  suffering  from  conditions  with  low  mortality  rates. 
APACHE'S  focus  on  critically  ill  patients  explains  the  emphasis  on  physiologic 
measurements.  The  first  version  of  APACHE  was  developed  by  clinicians  who 
identified  important  variables  and  assigned  relative  weights  to  abnormal  values. 
The  initial  physiologic  score  relied  on  34  variables  (Knaus  et  al.  1981).  Subsequent 
analyses,  using  mortality  as  the  predicted  variable,  reduced  the  number  to  12  in 
APACHEII  (Knaus  et  al.  1985). 

Using  APACHE  to  predict  inpatient  costs  appears  reasonable  because  of  both 
the  proportion  of  hospital  costs  devoted  to  critical  care  and  the  severity  hypothesis 
discussed  in  the  last  chapter,  but  there  are  three  problems. 

1.  Critical  and  Noncritical  Care.  The  system  was  developed  and  tested  on 
critical  care  cases,  and  APACHE's  performance  on  non-ICU  cases  is  unknown  (the 
only  study  of  this  issue  is  Coulton  et  al.  1985).    The  relation  of  risk  of  death  to 
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resource  consumption  may  be  very  different  in  non-ICU  patients. 

2.  Effects  of  Death.  The  confounding  effects  of  mortality  on  resource  use 
(Chapter  2.II.D)  are  an  especially  significant  problem  for  a  system  designed  to 
predict  mortality.  Wagner  and  Draper  (1984)  have  shown  that  resource  use  actually 
declines  in  patients  with  the  very  highest  APACHEII  scores. 

.  3.  Goals  of  Treatment.  In  the  group  of  patients  to  whom  APACHE  is  especially 
directed,  the  clinical  decision  of  whether  to  make  the  patient  comfortable  or  to  try 
to  preserve  life  is  critical  to  resource  use  (Chapter  2.I1.C.3);  the  system  cannot 
include  this  variable. 

In  sum,  further  research  is  necessary  before  we  can  be  confident  that  APACHE 
will  predict  costs  of  care  for  the  general  patient  population. 

E.  AVAILABILITY  OF  DOCUMENTATION.  The  APACHE  system  is  reasonably  well 
documented  (Knaus  et  al.  1981,  1985),  and  has  been  implemented  in  many  ICU 
settings  (Wagner  et  al.  1984).  The  APS12  component  of  the  score  is  particularly 
clear. 

F.  SUBJECTIVITY  AND  RELIABILITY.  The  variables  used  in  the  physiologic  and 
age  components  of  APACHE  are  quite  objective,  but  the  chronic  health  evaluation 
points  require  some  judgment.  Knaus  et  al.  (1985)  report  96  percent  agreement  on 
the  12  physiologic  measures  on  repeat  abstraction  (Table  3.1),  but  "somewhat  less" 
agreement  on  the  Chronic  Health  Index;  they  did  not  report  overall  agreement 
between  APACHE  scores  based  on  the  two  abstractions. 

G.  PREDICTIVE  PERFORMANCE.  APACHEII  has  been  applied  to  ICU  cases  at  13 
hospitals,  mostly  university  teaching  hospitals  (Knaus  et  al.,  1985;  Wagner  and 
Draper  1984).  The  original  APACHE  system  has  been  applied  more  broadly  (Knaus 
et  al.  1981,  1984),  including  a  recently  reported  international  study  involving  14 
hospitals     from     the     US,     France,    Spain     and    Finland    (Wagner    et    al.    1984). 
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While  the  analysis  of  APACHE  data  has  stayed  in  the  hands  of  the  developers,  data 
collection  has  been  carried  out  without  their  supervision  (Knaus  et  al.  1985). 

1.  Performance  at  the  Case  Level.  The  main  evidence  on  APACHE's  predictive 
peformance  is  its  ability  to  predict  mortality  (Table  3.2).  As  a  predictor  of  death, 
the  APACHEI  score  predicts  97  percent  of  deaths  and  49  percent  of  those  predicted 
to  die  do  so  (Knaus  et  al.  1981).  In  a  study  of  5030  patients  in  13  ICUs,  components 
of  APACHEIl,  admitting  diagnosis,  and  surgical  status  account  for  32  percent  of  the 
variance  in  mortality,  and  85  percent  of  predictions  as  to  mortality  are  correct 
(Knaus  et  al.  1985). 

Using  the  Therapeutic  Intervention  Scoring  System  (TISS)  as  a  proxy  for 
resource  use,  APACHEIl  with  surgical  status  explained  17  percent  of  the  variance  in 
resource  use  for  5,790  patients  in  13  ICUs  (Wagner  and  Draper  1984).  Only  ICU 
cases  were  included  and  the  patients  were  not  limited  to  Medicare  cases. 

There  are  no  studies  of  APACHE  limited  to  Medicare  patients  or  in  non-ICU 
patients.  Coulton  et  al.  (1985)  reported  very  uneven  results  when  they  applied 
APS  12  in  an  ICU/non-ICU  population;  in  one  DRG  the  correlation  between  the 
APS12  score  and  resource  use  was  negative. 

2.  Performance  at  the  Hospital  Level.  Adding  variables  identifying  different 
hospitals  explained  an  additional  5  percent  of  the  variance  in  TISS  score  in  Wagner 
and  Draper's  study  (1984),  but  they  did  not  measure  how  much  adding  APACHE 
reduced  variance  in  TISS  score  per  admission  across  hospitals.  Knaus  et  al.  (1986) 
have  used  APACHE  as  an  adjustor  for  studies  of  mortality  across  hospitals,  but  they 
did  not  test  the  ability  of  APACHE  to  account  for  interhospita!  variation  in 
intensive    care    mortality    rates.       Their    data    did    show,    however,    that    observed 
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hospital  mortality  rates  that  were  adjusted  using  APACHE  varied  much  less  than 
unadjusted  mortality  rates. 

III.  DISEASE  STAGING  (STAGING) 
Gonnella  and  associates  at  Thomas  Jefferson  University  developed  the  Disease 
Staging  system;  further  refinements  have  been  carried  out  by  Conklin  and  associates 
at  Systemetrics,  Inc.    Disease  Staging,  like  APACHE,  originated  as  a  quality  control 
system  rather  than  a  management  or  reimbursement  system. 

A.  CLASSIFICATION  STRUCTURE.  Staging  classifies  diseases  rather  than  patients 
and  generally  classifies  them  according  to  progression  of  an  underlying  disease 
rather  than  severity  of  acute  presenting  symptoms.    Staging  works  in  two  steps: 

—  Each  diagnosis  is  assigned  to  one  of  420  Staged  Disease  Categories  (SDC) 
according  to  its  origins  and  pathology.  SDCs  are  nonordinal;  the  category 
number  says  nothing  about  expected  resource  use. 

—  Within  the  SDC,  the  condition  is  then  assigned  to  one  of  four  stages  of 
disease  progression;  stage  four  is  always  death.  Stages  within  SDC  are 
ordinal  in  that  higher  stages  are  always  more  clinically  advanced. 

If  Staging  is  used  for  payment,  a  third  step  is  necessary  to  classify  patients  who 
have  more  than  one  SDC. 

The  staging  concept  developed  in  clinical  oncology,  where  stages  represent 
progressive  degrees  of  cancer  spread  and  stage  generally  correlates  with  morbidity 
and  mortality.  Disease  Staging  generally  preserves  this  meaning  (Gonnella  et  al. 
198^^).    The  stages  are: 

—Stage  I     -  conditions  with  no  complications, 

— Stage  II    -  problems  limited  to  an  organ  or  system, 

—Stage  III  -  multiple  site  involvement, 
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—stage  IV    -  death. 

For  each  disease,  criteria  for  each  stage  are  specified. 

Because  Staging  criteria  are  disease  specific,  Stages  are  not  directly 
comparable  across  diseases.  Stage  2  hearing  loss  is  very  different  from  Stage  2 
breast  cancer  (Gonnella  et  al.  1984;  Conklin  et  al.  1984b). 

Staging  software  stages  all  diagnoses.  The  system  then  offers  two  choices  of 
the  principal  staged  disease — either  the  staged  principal  diagnosis  or  the  underlying 
staged  disease;  the  latter  is  the  highest  staged  SDC  related  to  the  principal 
diagnosis  (based  either  on  the  principal  diagnosis  itself  or  on  any  secondary  diagnosis 
related  to  the  principal  diagnosis).  Information  concerning  staged  conditions 
unrelated  to  the  underlying  staged  disease  may  or  may  not  be  retained  for  further 
study  (Louis  et  al.  1983,  Conklin  1985). 

In  Staging,  substages  are  often  specified  as  well,  reflecting  clinical  differences 

in  progression.    Substages  generally  follow  the  rule  that  higher  numbers  represent 

greater  disease  progression  and  a  higher  risk  of  morbidity  and  mortality  (Gonnella 

el.  1983),   Some  substages,  however,  are  not  ordinal;  they  simply  specify  the  locus  of 

the  disease  more  precisely. 

B.  VARIANTS.     The  two  major  variants  are  manual  and  computerized  Staging.    In 

addition,  within  the  computerized  version,  there  have  been  software  modifications 

to  embody  alternative   methods  for  handling  principal  diagnosis  Staging,  primary 

disease  Staging,  and  the  Staging  of  unrelated  secondary  diseases  (Louis  et  al.  1983; 

Conklin  et  al.  1984a,b).    Some  ICD-9-CM  codes  had  no  corresponding  SDC  and  could 

not  be  staged  by  earlier  versions,  but  recent   modifications  of  the  system   made 

possible   staging  of  all  diagnoses.      These   modifications  also  removed  an  earlier 

feature  in  which  cancers  had  five  stages. 

The  original  manual  Staging  could  be  used  to  assign  Stage  at  three  points  in  a 
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hospital  stay:  on  admission,  "peak"  Stage,  and  at  discharge.  The  computerized 
version  generates  a  single  assignment  of  Stage,  which  is  probably  closest  to  the 
"peak"  stage  (Arthur  Young  Co.  1983). 

C.  DATA  REQUIREMENTS.  The  manual  and  computerized  versions  of  Staging  have 
different  data  requirements.  The  manual  format  provides  detailed  criteria  for 
assigning  stage  and  substage  based  on  a  review  of  the  entire  medical  record.  Manual 
Staging  requires  special  training  for  coders. 

The  computerized  version  requires  only  UHDDS  data  to  assign  stages,  although 
it  can  accept  additional  diagnoses  and  procedures. 

D.  FACE  VALIDITY.  Staging  was  developed  with  diverse  goals,  including  utilization 
review,  patient  care  evaluation,  hospital  management,  and  prediction  of  morbidity 
and  mortality.  The  genesis  of  the  system  was  entirely  clinical,  with  panels  of 
physicians  developing  both  the  definitions  of  the  420  Staged  Diseases  and  the 
criteria  for  assigning  Stage  and  Substage  within  each  disease  group  (Gonnella  et  al. 
1984).  The  diverse  goals  of  Staging  and  the  absence  of  resource  data  during  its 
creation  do  not  give  Staging  high  face  validity  in  predicting  resource  use.  The 
Staging  system  ought  to  predict  resource  use  when  patients  with  more  advanced 
disease  require  more  resources,  but  sometimes  the  presence  of  acute,  treatable 
clinical  problems  may  predict  need  for  care  better  than  does  the  degree  of  disease 
progression.  As  with  other  case  mix  systems,  conditions  resulting  in  death  will 
sometimes  consume  less  resources  than  those  with  better  outcomes;  Stage  4  patients 
sometimes  have  lower  costs  than  patients  in  Stage  3  (Conklin  et  al.  1984a;  Conklin 
1985). 

The   decision   to    minimize  use  of  procedure  codes  is  both  a  strength  and  a 

weakness.    The  strength  is  that  the  categories  depend  more  on  what  is  wrong  with 

the  patient  and  less  on  what  is  done,  while  the  weakness  is  that  a  system  of  this  kind 
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seems  likely  (Chapter  2.II.A.1)  to  penalize  referral  surgical  centers  (Chapter  4. Ill 
reviews  evidence  that  this  would  indeed  happen).  We  will  encounter  this  problem 
again  with  PMCs. 

E.  AVAILABILITY  OF  DOCUMENTATION.  -  Staging  computer  software  and 
documentation  are  available  for  purchase.  A  complete  set  of  manual  Staging 
criteria  are  also  available.  Several  researchers,  besides  the  original  developers  of 
Disease  Staging,  have  used  either  the  manual  (Arthur  Young  Co.  1983)  or 
computerized  versions  (Coffey  and  Goldfarb  1984;  Calore  1985). 

F.  SUBJECTIVITY  AND  RELIABILITY.  Staging  is  vulnerable  to  the  same 
subjectivity  in  coding  diagnoses  as  is  the  DRG  system  (Chapter  2.IV.E).  Although 
the  system  uses  secondary  diagnoses  in  a  more  precise  way,  incentives  are  still 
present  to  code  additional  secondaries  and  to  "upgrade"  coding  in  ambiguous 
situations.   There  are  no  studies  documenting  this  risk. 

Manual  staging  is  more  vulnerable  to  manipulation  than  the  computerized 
version,  because  coders  require  special  training  and  must  review  an  entire  chart. 
Preliminary  evidence  suggests  that  manual  Staging  is  "moderately  reliable"  under 
research  conditions  (Richards  et  al.    1986). 

Computerized  Staging  uses  the  same  UHDDS  data  as  does  the  DRG  system,  so 
inaccuracies  of  diagnosis  coding  will  affect  Staging;  since  Staging  makes  minimal 
use  of  procedures,  inaccuracy  in  procedure  coding  is  a  smaller  problem.  We  are 
unaware  of  studies  of  the  impact  of  coding  errors  on  a  Staging  CMI.  When 
underlying  staged  disease  is  used,  Staging  is  probably  less  vulnerable  to  manipulation 
of  sequencing  but  more  vulnerable  to  addition  of  marginal  secondary  diagnoses. 

Information  on  the  reliability  of  computerized  Staging  has  not  been  reported, 

but  Louis  et  al.  (1984)  have  reported  78-80  percent  agreement  between  manual  and 

computerized  Staging  in  assigning  Stage  (Table  3.1),  depending  on  how  one  handles 
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cases  with  incorrect  prinicpal  diagnosis. 

G.  PREDICTIVE  PERFORMANCE.  Data  on  predictive  performance  are  limited  to 
computerized  staging.  Several  large  data  sets  have  been  used  by  the  developers  of 
Disease  Staging,  including  the  National  Center  for  Health  Services  Research  Health 
Care  Utilization  Project  (HCUP)  file  for  1977  (with  data  from  373  short-term  non- 
Federal  hospitals)  (Gonnella  et  al.  1984).  Other  investigators  have  used  Staging  on 
large  data  sets:  all  Maryland  Medicare  discharges  for  1979  through  1981  (Coffey  and 
Goldfarb  1984;  Coffey  1985),  the  HCUP  file  (Short  and  Coffey  1984)  and  a  sample  of 
500,000  discharges  from  24  teaching  hospitals  (Butler  and  Bentley  1982). 

Medicare  discharges  from  54  Maryland  hospitals  for  1981  and  from  80  New 
Jersey  hospitals  from  1979  have  also  been  used  in  developing  and  testing  a  proposed 
modification  of  17  DRGs  using  a  Staging-based  classification  scheme  (Conklin  et  al. 
1984a,  1984b;  Conklin  1985). 

1.  Performance  at  the  Case  Level.  In  both  the  Maryland  Medicare  data  (Coffey 
and  Goldfarb  1984)  and  in  the  Michigan  data  (Calore  1985),  Staging  performed  less 
well  overall  than  DRGs  in  explaining  cost  per  case  or  length  of  stay  (Table  3.3). 

SysteMetrics  developed  alternative  groupings  which  incorporated  Stage  and 
UHDDS  variables  (Conklin  et  al.  1984a;  Conklin  et  al.  1984b;  Conklin  1985)  for  17 
DRGs  in  10  adjacent  DRG  clusters.  (DRGs  in  an  adjacent  DRG  cluster  differ  only  in 
a  single  variable  used  to  divide  the  DRG  pair.)  The  alternative  groups  outperformed 
DRGs  for  cases  in  the  original  17  DRGs  (Table  3.4).  These  results  suggest  that 
Staging  may  be  useful  in  refining  highly  variable  DRGs. 

2.  Performance  at  the  Hospital  Level.     All  reported  studies  on  reduction  of 

variance  and  homogeneity  have  been  at  the  case,  rather  than  the  hospital,  leveL 

However,  Coffey  and  Goldfarb  (1984)  and  Coffey  et  ah  (1985)  have  examined  the 

payment  implications  of  using  Staging  rather  than  DRGs  (see  Chapter  4. IV).    They 
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found  that,  at  the  hospital  level,  Staging  would  result  in  appreciable  changes  in 
payments  compared  to  DRGs— apparently  as  a  result  of  surgical  cases.  They  did  not, 
however,  compare  payments  to  actual  costs. 

IV.  MEDICAL  ILLNESS  SEVERITY  GROUPING  SYSTEM  (MEDISGRPS) 
Alan  Brewster  and  associates  at  St  Vincent's  Hospital,  Worcester 
Massachusetts,  developed  MEDISGRPS,  and  Brewster  is  pursuing  its  development  at 
MediQual,  Inc.  MEDISGRPS  was  designed  for  hospital  quality  control  (Brewster  et 
al.  1985b).  MEDISGRPS  is  the  newest  and  least  tested  of  the  systems  described 
here. 

A.  CLASSIFICATION  STRUCTURE.  MEDISGRPS  uses  special  abstracting  forms  to 
determine  "key  clinical  findings"  and  a  computerized  algorithm  to  assign  a  patient 
to  one  of  five  ordinal  severity  levels  based  on  the  key  findings.  The  levels  are 
labeled  0  through  4,  with  higher  numbers  representing  a  higher  probability  of  failure 
of  one  or  more  organ  systems.  The  MEDISGRPS  level,  like  that  of  APACHE  and 
SOU,  applies  to  the  patient,  rather  than  to  the  disease,  and  can  be  used  to  compare 
patients  across  diagnostic  categories  (Brewster  et  al.  1985b). 

The  abstraction  form  is  designed  to  accommodate  information  on  up  to  500 
different  findings,  including  physical  examination  findings,  laboratory,  x-ray,  and 
pathology  results,  and  transfusions  (Brewster  et  al.  1984a).  The  system  generates  a 
score  based  on  the  data  which  are  available  and  does  not  require  special  tests  or 
examinations  of  the  patient. 

MEDISGRPS  is  designed  to  use  two  reviews,  one  "on  admission"  (actually  48 
hours  after  admission  to  allow  relevant  information  to  reach  the  patient's  chart), 
and  the  second  on  the  sixth  post-operative  day  or  the  eighth  day  after  admission. 
Data  collected  at  the  second  review  are  also  used  to  assess  intrahospital  morbidity 
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or  discharge  status  for  those  who  have  already  been  discharged  (Brewster  et  al. 
1985a).  For  patients  with  lengthy  hospitalizations  a  third  review  may  be  carried  out 
(Mediqual  Systems,  December  198^). 

B.  VARIANTS.  MEDISGRPS  has  been  revised  a  number  of  times,  but  differences 
between  the  versions  have. not  been  documented  for  the  public.    Subscribers  receive 

.  the  most  recent  version  of  the  system,  and  analytic  files  have  been  updated  to 
reflect  this  version.  Apparently,  changes  involve  the  way  computer  software 
integrates  data  to  produce  a  MEDISGRPS  score  rather  than  the  raw  data  necessary 
to  produce  the  score. 

C.  DATA  REQUIREMENTS.  The  full  version  of  MEDISGRPS  requires  two  reviews  of 
each  patient's  medical  record  (Brewster  et  al.  1985a;  Brewster  et  al.  1985b),  entry 
of  data  onto  an  abstraction  form,  and  transfer  of  data  from  the  form  to  a  computer. 
Brewster  et  al.  (1985a)  estimate  that  abstraction  and  data  entry  require  about  12 
minutes  per  case.  The  cost  of  a  single  review  for  reimbursement  has  not  been 
evaluated. 

System  costs  include  training  abstracters,  abstracting  charts,  and  data  entry- for 
the  computer.  Given  the  large  amount  of  input,  the  computational  costs  of  this 
system  may  be  higher  than  the  computational  costs  of  some  of  UHDDS-based 
systems  such  as  DRGs  or  Staging. 

D.  FACE  VALIDITY.     MEDISGRPS,  like  all  the  otlTer  case  mix  methods  reviewed 

here,  used  expert  judgment  in  conceptualizing  the  system  and  defining  the  criteria 

for  each  classification  level.     Defining  levels  in  terms  of  increasing  probability  of 

organ    failure    is   appreciably   different   from   describing  either   degree   of    disease 

progression  or  acute  physical  findings  which  predict  the  likelihood  of  death.     The 

MEDISGRPS  model  avoids  both  the  difficulties  in  using  likelihood  of  death,  which  we 

reviewed  regarding  APACHE,  and  the  problems  with  disease  progression,  which  we 
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reviewed  regarding  Staging.  Although  the  comparability  of  levels  across  different 
kinds  of  disease  is  uncertain,  this  would  not  be  critical  if  MEDISGRPS  were  used  as 
refinement  for  DRGs  or  another  disease-oriented  system. 

E.  AVAILABILITY  OF  DOCUMENTATION.  MEDISGRPS  documentation  is  available 
only  to  individuals  who  sign  an  agreement  not  to  reproduce  or  disseminate  it.  Full 
descriptions  of  the  computer  algorithm  are  not  available  even  to  subscribers  to  the 
system.  This  laci<  of  documentation  makes  detailed  assessment  of  the  system 
impossible  for  outsiders.  These  difficulties  may  lessen  when  the  system's 
proprietors  are  more  certain  how  much  they  can  reveal  without  compromising  their 
commercial  position. 

F.  SUBJECTIVITY,  VULNERABILITY  TO  MANIPULATION,  AND  RELIABILITY. 
Two  features  of  MEDISGRPS  may  make  it  vulnerable  to  manipulation: 

The  MEDISGRPS  algorithm  uses  the  most  abnormal  value  when  multiple 
values  of  a  test  are  available  (Brewster  et  al.  1985b),  which  may 
somewhat  raise  a  patient's  classification  based  solely  on  the  number  of 
tests  done. 

The  system  uses  some  data  which  are  relatively  subjective,  such  as 
physical  findings  and  radiology  interpretations.  Such  data  may  be  stable 
in  the  absence  of  reimbursement  incentives,  but  we  have  no  data  on 
whether  payment  incentives  will  change  reports. 

G.  RELIABILITY.     Information  on   the  reliability  of  the  MEDISGRPS  abstraction 

process  is  sketchy.    One  report  (Brewster  et  al.  1985b)  notes  that  only  1  percent  of 

the    trainees    failed    to    meet    95    percent   accuracy    standards   after    9    weeks   of 

experience  (Table  3.1).    Accuracy  in  this  case  was  measured  by  comparing  hospital 

trainee  abstraction  forms  to  those  of  an  expert  coder.    As  noted  in  the  last  chapter, 

reliability  figures  with  small  numbers  of  categories  should  be  adjusted  for  chance 
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agreement,  which  has  not  been  done.  Data  on  reliability  of  repeat  ratings  by  the 
same  rater  are  not  available. 

H.  PREDICTIVE  PERFORMANCE.  MEDISGRPS  was  developed  on  a  data  set  from 
one  hospital  (Brewster  et  al.  1985b).  One  subsequent  report  has  included  limited 
data  from  two  additional  hospitals  (Brewster  et  al.  198^c). 

1.  Performance  at  the  Case  Level.  MEDISGRPS  has  primarily  been  tested  as  a 
predictor  of  mortality  (Table  3.5),  morbidity  (Table  3.6),  and  charges  in  selected 
conditions:  abdominal  pain,  chest  pain,  shortness  of  breath  and  extremity  pain. 
(Table  3.7  shows  predictive  power  based  on  the  first  review  and  Table  3.8  shows 
predictive  power  based  on  the  second  review.)  Total  charges  for  different  severity 
groups  are  significantly  different. 

Unpublished  work  by  Moskowitz  and  Ash  at  Boston  University  using  data  from 
hospitals  subscribing  to  the  MEDISGRPS  system,  indicates  that  DRGs  alone  explain 
about  3^  percent  of  the  variance  in  charges  while  DRGs  plus  MEDISGRPS  explain 
about  39  percent  of  the  variance.  MEDISGRPS  has  not  yet  been  tested  on  a  random 
or  nationally  representative  sample  of  hospitals. 

b.  Performance  at  the  Hospital  Level.  Reports  on  hospital  level  performance 
are  not  available. 

V.  PATIENT  MANAGEMENT  CATEGORIES  (PMCs) 

Wanda  Young  developed  the  system   at  Blue  Cross  of   Western   Pennsylvania 

specifically  for  cost  analysis  and  hospital  reimbursement  (Young  et  al.    1982).   PMCs 

are  a  nonordinal  classification  system  accompanied  by  a  normative  but  optional  set 

of  "resource  weights." 

A.  CLASSIFICATION  STRUCTURE.     The  PMC  system  is  a  clinically  based  system 

that  assigns   patients  to  one  or   more  of   800   PMCs  (Young    198^).      PMC's   were 
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designed  to  create  patient  groups  that  are  similar  both  in  expected  total  resource 
use  and  in  expected  diagnostic  and  treatment  services. 

PMCs  are  grouped  into  47  modules,  each  representing  a  disease  or  disorder. 
The  first  two  digits  of  the  PMC  number  refer  to  the  module  and  are  nonordinal.  The 
second  two  digits  identify  a  particular  PMC  within  the  module.  The  second  two 
digits  do  not  reflect  a  hierarchy,  but  the  classification  program  is  hierarchical  in 
that  the  PMC  to  which  a  patient  is  assigned  within  a  module  is  the  PMC  within  a 
module  which  has  the  highest  expected  resource  consumption  of  all  the  PMCs  for 
which  the  patient  may  qualify  (Young,  1985).  The  PMC  classification  software  can 
assign  a  patient  to  several  PMCs  if  multiple  diagnoses  lead  to  several  modules. 
PMC  assignments  do  not  depend  on  the  order  in  which  diagnoses  are  listed,  but  the 
PMC  associated  with  the  principal  diagnosis  may  be  given  priority  for  analytic  or 
payment  purposes.  Procedures  are  used  to  make  about  36  percent  of  PMC 
assignments;  age  and  sex  are  used  in  a  few  instances. 

The  PMC  system  also  specifies  "components  of  care"  for  each  PMC,  a  set  of 
diagnostic  and  treatment  services  needed  to  provide  effective  care  for  an  "average" 
patient.  These  components  of  care  were  constructed  by  about  50  panels  of 
physicians  and  a  normative  "cost  weight"  is  available  for  each  PMC  (Young  et  al., 
1985).  PMCs  can  also  be  used,  like  DRGs,  with  empirically-determined  cost  weights 
(e.g.,  Calore  1985).  The  method  proposed  by  Young  for  handling  cases  with  multiple 
PMC  assignments  combines  the  components  of  care  for  all  assignments,  deletes 
components  of  care  which  have  been  duplicated,  and  determines  a  cost  weight  from 
the  remaining  list  of  components. 

B.   VARIANTS.      An  early  version  of  PMCs,   which  used  reason   for  admission  in 
addition  to  the  usual  UHDDS  data,  has  been  abandoned. 
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C.  DATA  REQUIREMENTS.  PMCs  operate  on  UHDDS  data.  If  the  alternative 
version  is  used,  reason  for  admission  must  be  hand  abstracted. 

D.  FACE  VALIDITY.  The  PMC  system  comprises  a  classification  system  and  the 
normative  "cost  weights"  based  on  components  of  care.  The  classification  system 
could  be  valid  even  if  the  cost  weights  bore  no  relation  to  true  resource  needs,  but 
the  cost  weights  have  no  meaning  outside  the  context  of  the  classification  system. 

The  PMC  classification  system  was  specifically  developed  to  group  patients 
according  to  their  need  for,  rather  than  their  use  of,  diagnostic  and  therapetic 
services.  This  clearly  distinguishes  PMCs  from  other  systems  reviewed  here,  which 
were  designed  to  describe  clinical  severity,  length  of  stay,  or  empirical  costs. 
Against  this  more  credible  origin  must  be  set  the  decision  of  developers  of  PMCs  not 
to  provide  actual  length  of  stay  and  resource  data  to  their  consulting  clinicians  as 
they  were  constructing  the  system. 

The  validity  of  the  cost  weights  depends  on  the  degree  to  which  expert  panels 
were  able  to  reliably  identify  needed  components  of  care  and  the  appropriateness  of 
the  methods  used  to  convert  the  specified  components  of  care  to  costs.  Since 
PMCs  are  the  first  comprehensive  effort  to  assign  normative  costs  to  patient 
groups,  we  cannot  estimate  the  degree  to  which  different  expert  panels  might  arrive 
at  similar  estimates.  As  with  construction  of  the  classification  algorithm,  the  fact 
that  the  panels  worked  without  actual  cost  data  means  that  they  were  not 
influenced  by  prevalent  patterns  of  care.  If  prevalent  patterns  were  appropriate, 
this  decision  probably  put  the  developers  al  a  disadvantage,  but  if  prevalent  patterns 
were  inappropriate  the  decision  would  make  PMCs  more  credible. 

E.  AVAILABILITY  OF  DOCUMENTATION.    Because  the  PMC  system  was  developed 

under    a    research    grant    and    because    the    system     was    intended    for    use    in 

reimbursement,    both    the   computer    program    and   descriptive   documentation  are 
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available  (Young  et  al.  1985).  This  documentation  appears  to  be  adequate  for 
implementing  PMCs.  However,  the  source  code  for  the  algorithm  is  not  available, 
and  the  program  itself  is  not  fully  documented,  so  that  an  investigator  cannot 
determine  how  most  PMC  assignments  are  actually  made.  This  documentation 
deficiency  poses  significant  problems  for  research  and  nonstandard  uses  of  the 
system. 

F.  SUBJECTIVITY  AND  RELIABILITY.  The  reliability  of  PMCs,  like  that  of  other 
UHDDS-based  systems,  depends  on  the  accuracy  of  the  discharge  data  abstract.  As 
with  DRGs  and  computerized  Staging,  PMC's  are  vulnerable  to  upgrading  of 
diagnosis  codes;  no  estimate  of  the  magnitude  of  this  vulnerability  is  available.  The 
fact  that  the  system  does  not  depend  on  the  order  of  diagnoses  decreases  its 
vulnerability  to  manipulation  of  the  sequence  of  diagnoses.  On  the  other  hand,  this 
same  feature  may  mean  that  the  system  is  more  influenced  by  listing  marginal 
secondary  diagnoses.  Data  on  this  issue  are  not  available.  Reabstracting  reliability 
studies  of  the  validity  of  PMCs  are  not  available. 

G.  PREDICTIVE  PERFORMANCE.  PMCs  have  been  applied  to  all  discharges  from 
90  western  Pennsylvania  hospitals  for  1981  through  1983  and  to  all  discharges  from 
55  Maryland  hospitals  in  1983.  PMCs  have  now  been  used  both  on  the 
developmental  data  set  (Young  1984,  1985)  and  on  all  Medicare  cases  from  Michigan 
from  1982  (Calore  1985). 

1.  Performance  at  the  Case  LeveL     Young  has  tested  the  ability  of  PMCs  to 

predict  both    mortality   and   length  of  stay   (Table    3.9).      The   developers   report 

statistics  on  the  stability  of  death  rates  in  selected  PMCs  over  time  (Young  1985)  as 

evidence  of  the  validity  of  the  system.    The  primary  performance  analysis  examined 

the   homogeneity    of   length   of   stay    within    PMCs.      The  average  coefficient  of 

variation  was  0.59  in  the  trimmed  data  set  and  0.79  in  the  untrimmed  data.    This  is 
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lower  than  the  0.94  for  DRGs  in  untrimmed  data  sets  reported  by  Coffey  and 
Goldfarb  (1984)  and  the  0.88  to  0.93  reported  by  Coffey  and  Goldfarb  (1984)  for 
Staging,  but  it  is  roughly  comparable  to  the  0.82  mean  coefficient  of  variation  for 
DRGs  found  in  untrimmed  Medicare  data  (Cotterill  et  al.,  1985). 

Galore  (1985)  found  PMCs  slightly  weaker  than  DRGs  in  explaining  Medicare 
Part  A  (hospital)  costs  in  1982  Michigan  data.  PMCs  explained  15  percent  of  the 
variance  in  cost  per  case  in  the  untrimmed  data  set  and  26  percent  in  the  trimmed 
data  set;  DRGs  accounted  for  17  and  30  percent  respectively.  Galore  did  not  use 
the  normative  PMC  weights  in  this  analysis. 

2.  Performance  at  the  Hospital  Level.  There  are  no  studies  of  the  performance 
of  PMCs  at  the  hospital  level. 

VI.  SEVERITY  OF  ILLNESS  INDEX  (SOU)  and  COMPUTERIZED  SEVERITY  INDEX 
(CSI) 

Susan  Horn  at  Johns  Hopkins  developed  the  SOIL    SOU  has  been  developed  and 
promoted  both  as  a  case  mix  management  system  for  hospitals  (especially  teaching 
hospitals)  and  as  a  possible  refinement  to  the  DRG  system  for  use  in  PPS. 
A.  CLASSIFICATION  STRUCTURE. 

1.  son  classifies  patients  on  a  one  to  four  ordinal  scale  which  rates  overall 
burden  of  illness  rather  than  an  individual  disease  (Horn  et  al.  1984a).  A  specially 
trained  abstractor  assigns  the  SOU  based  on  a  review  of  the  whole  medical  record. 
The  rater  determines  the  SOU  by  "implicit  integration"  (Horn  et  aL  1983b)  of  seven 
severity  dimensions,  each  of  which  also  has  four  levels.  The  dimensions  are: 

Stage  of  the  principal  diagnosis  (not  the  same  as  the  Disease 

Staging  assignment), 

Complications  of  the  principal  diagnosis, 
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Concurrent  interacting  conditions  affecting  hospital  course, 

Dependency  on  hospital  staff, 

Extent  of  nonoperating  room  life  support  procedures. 

Rate  of  response  to  therapy  or  rate  of  recovery,  and 

Residual  impairment  (Horn  et  al.  1985b). 

In  some  applications,  each  level  of  the  SOU  is  further  divided  into  three  parts: 

without  an  operating  room  procedure,  with  a  moderate  operating  room  procedure, 

and  with  a  major  operating  room  procedure  (Horn  et  al.  1985b)  using  a  list  of  ICD-9- 

CM   procedure   codes.      The    12   categories   of   the   procedure-adjusted   SOU   do   not 

comprise  an  ordinal  scale  (Horn  et  al.  198^c). 

Assigning   the   SOU   requires   a   principal   diagnosis   (Horn   and   Horn    1985).      In 

practice,  the  SOU  is  usually  applied  to  patients  after  they  have  been  grouped  using  a 

disease  categorization  scheme  such  as  DRGs  (Horn  et  al.  1985b;  Horn  1985;  Horn  et 

al.  1985a;  Horn  and  Sharkey  1983).    Although  SOU  is  generic  and  not  disease-specific 

(Horn    et    al.    1985b),    SOU    levels    may    not    be    comparable    across    major    disease 

categories. 

2.     CSI.    Horn  (1986)  is  now  developing  the  CSI,  which  is  so  different  from  the 

son  that   it   is  grouped  with   it  only  for   convenience.      CSI   divides   all   ICD-9-CM 

diagnoses  into  about  700  categories,  and  there  is  a  specific  set  of  severity  criteria 

for  each  category.    The  criteria  use  clinical  findings  and  the  results  of  laboratory, 

X-ray,  and  electrocardiographic  exams  to  assign  a  severity  score  to  the  principal 

diagnosis  and  all  secondary  diagnoses.    A  coder  enters  the  diagnoses  into  a  computer 

and  the  computer  then  requests  the  specific  items  which  the  coder  must  abstract 

from  the  chart.    The  computer  then  uses  the  criteria  specific  to  each  diagnosis  to 

assign  a  severity  level  to  each  diagnosis.    Finally,  using  a  weighting  algorithm,  the 

program    assigns    an    overall    index    to    the    patient.       The    criteria    for    individual 
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diagnoses,  the  weights  assigned  to  the  criteria  in  giving  a  severity  score  to  an 
individual  diagnosis,  and  the  weighting  scheme  used  for  creating  an  overall  severity 
index  were  all  developed  on  clinical  grounds  but  without  interaction  between  actual 
data  and  the  developers. 

Horn  has  recently  entered  into  cooperation  with  Health  Systems  International,  a 
software  firm,  to  produce  the  software  for  CSI,  and  field  tests  were  scheduled  to 
begin  in  October  1986.  Whether  the  CSI  would  be  used  as  a  modification  of  DRGs  or 
in  conjunction  ^vith  its  own  diagnosis  categories  is  not  yet  clear. 

In  this  discussion  we  will  focus  on  SOU  with  some  comment  on  how  CSI  differs 

from  it. 

3.     VARIANTS.     The   published   studies  on  SOU  all  use  one  or  more  versions  of  a 

manual    scoring    system.       Within    manual    SOU,    modifications    have    been    almost 

continual    since    the    systetn    was    first    introduced.       The    major    variant    is    the 

"procedure-adjusted  SOU"  described  above. 

C.  DATA   REQUIREMENTS.     SOU  differs  from   the  other  five  systems  both  in  the 

type  of  data  input  it  requires  and  in  the  use  of  subjective  human  "integration"  to 

arrive    at    scores.      SOU    requires    that    a    specially    trained    abstractor    review    the 

medical  record  and  record  a  rating  on  seven  different  four-point  scales.    The  coding 

of  these  scales,  unlike  the  coding  of  the  data  elements  of  APACHE  or  MEDISGRPS, 

depends  not  on   locating  specific  clinical  findings  in   the  chart,  but  rather  on   the 

subjective  integration  by  the  abstractor  of  the  entire  content  of  the  record.    The 

developer  reports  that  the  abstraction  takes  a  trained  coder  less  than  3  minutes  per 

chart  (Horn   1985)  when  done  as  part  of  routine  chart  completion,  which  presumably 

means  that  uncomplicated  cases  can  be  coded  very  quickly.     For  a  hospital  with 

2,000    Medicare    cases    per    year,    this    represents    about    100    additional    hours    of 

abstractor  time  per  year.  Other   sources  estiinate  that  an  average  hospital   would 
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have  to  hire  two  additional  abstractors  (Nathanson  1985).  Horn  reports  that 
abstractors  can  be  trained  quickly. 

CSI  will  require  manual  abstraction  of  data  and  rater  judgment  as  to  whether 
medical  record  data  satisfy  the  rating  criteria;  only  the  combination  of  criteria  into 
codes  and  the  combination  of  codes  into  a  CSI  rating  is  computerized.  Thus,  the 
level  of  computerization  is  similar  to  that  of  MEDISGRP3  rather  than  that  of  DRGs, 
PMCs,  or  Staging.  CSI  is  so  different  in  concept  and  implementation  from  the 
manual  SOU  that  it  may  be  better  to  think  of  it  as  a  separate  system,  rather  than  as 
a  variation.  The  data  collection  burden  of  CSI  relative  to  SOU  is  not  yet  defined. 
D.  FACE  VALIDITY.  Horn  developed  SOU  as  an  alternative  to  DRGs  for 
determining  hospital  reimbursement  under  the  Maryland  prospective  payment  plan 
(Horn  1985).  Nevertheless,  SOU  was  not  created  as  a  direct  predictor  of  resource 
consumption  but  to  measure  the  patient's  burden  of  illness  indirectly  by  asking  "How 
sick  is  the  patient?"  (Horn  et  al.  1984b).  If  SOU  measures  this  burden  and  if  sicker 
patients  need  more  resources,  SOU  can  be  expected  to  predict  resource  need.  As  we 
have  already  observed  regarding  Staging  and  APACHE,  some  very  Ql  patients  will 
have  decreased  resource  needs,  but  this  should  not  create  greater  problems  for  SOU 
than  for  other  systems. 

The  seven  subscales  of  SOU  have  face  validity.     Rate  of  response  to  therapy, 

interacting  conditions,  dependency  and  complications  are  all  factors  which  should 

complicate  treatment  and  increase  costs.    But  the  objectivity  and  measurability  of 

these  dimensions  have   less  face   credibility.      The   concepts   are   imprecise,    it   is 

difficult  to  define  explicit  criteria  which  are  general  enough  to  apply  to  all  patients, 

and  training  manuals  which  are  supposed  to  solve  these  problems  are  not  available 

for  public  inspection. 

The  overall  score  is  most  often  used  in  reporting  data  about  the  SOU,  but  the 
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phrase  "implicit  integration"  (Horn  et  al.  1983a)  does  not  suggest  that  the  overall 
scale  is  highly  reliable.  More  recently  Horn  has  described  a  mathematical  formula 
for  forming  the  overall  score,  but  it  is  not  clear  whether  this  formula  simply 
describes  how  rating  usually  works  (Horn  et  al.  1985b)  or  prescribes  how  the  overall 
score  is  to  be  given  (Horn  and  Horn  1986). 

A  problem  with  the  face  validity  of  SOU  is  that  response  to  treatment  may 
reflect  either  the  patient's  illness  or  efficiency  of  treatment,  while  a  major  goal  of 
PPS  is  to  avoid  paying  for  inefficiency.  To  a  lesser  extent,  similar  arguments  apply 
to  use  of  life  support  systems  and  extra  nursing  care.  SOU  presents  a  special 
problem  because  published  data  do  not  allow  separation  of  scoring  based  on 
admission  condition  from  scoring  based  on  treatment.  The  SOU  strategy,  therefore, 
runs  the  risks  which  we  reviewed  in  Chapter  2  when  considering  payment  based  on 
inhospital  events.  By  contrast,  CSI  applies  to  a  point  in  time  and  thus  can  be 
measured  at  admission,  during  the  stay,  or  at  discharge. 

Until  CSI  is  available  for  review,  no  independent  judgment  about  its  face 
validity  or  other  characteristics  is  possible. 

E.  AVAILABILITY  OF  DOCUMENTATION.  While  much  has  been  written  about  SOU, 
almost  all  of  the  reports  to  date  have  come  from  the  original  developer  of  the  the 
index  (Horn,  et  al.,  1983b;  Horn,  et  al.,  1983a;  Horn  and  Sharkey  1983;  Horn,  et  al., 
1984a;  Horn  et  aL  1985a;  Horn  et  aL  1985b).  Because  the  system  depends  on  the 
ability  of  the  abstractors  to  reliably  code  the  SOU  score,  the  key  to  implementing 
the  system  is  the  training  of  the  abstractors.  Only  limited  documentation  on 
abstractor  training  is  publicly  available,  and  no  material  is  publicly  available  which 
would  allow  objective  resolution  of  conflicts  over  SOU  assignment. 

The  CSI  is  not  yet  documented. 
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F.  SUBJECTIVITY,  VULNERABILITY  TO  MANIPULATION,  AND  RELIABILITY. 
The  SOU  relies  far  more  than  other  case  mix  methods  on  subjective  and  implicit 
judgments  by  the  coder,  a  problem  which  is  not,  as  we  have  pointed  out,  relieved  by 
high  reported  reliability.  Hospitals  which  have  used  SOU  have  been  "volunteers" 
which  adopted  it  for  internal  management.  Medical  records  personnel  have  had 
strong  incentives  not  to  manipulate  the  system  because  they  work  for  a  hospital 
management  which  adopted  the  system  in  order  to  identify  problems  and 
discrepancies.  This  incentive  would  be  absent  if  hospital  management  wished  to 
obtain  maximum  reimbursement.  The  degree  to  which  the  system  could  be  audited 
is  unclear  because  of  limited  publicly  available  documentation.  SOU  has  functioned 
to  date  by  using  Horn  and  her  staff  as  the  final  arbiters  of  correct  coding,  but  that 
solution  appears  impractical  for  a  national  payment  system. 

More  information  has  been  published  on  reliability  of  SOU  than  on  any  of  the 
other  systems  being  reviewed.  The  reported  levels  of  agreement  between  expert 
coders  and  hospital  based  coders  are  high  under  Horn's  supervision:  between  88  and 
93  percent  (Table  3.1).  These  statistics  are,  however,  uncorrected  for  chance 
agreement  between  raters.  Although  the  frequency  of  agreement  of  SOU  raters  may 
be  better  (Horn  and  Horn  1986)  than  that  for  diagnostic  coding,  there  is  a  difference 
between  agreement  when  there  are  four  choices  (SOU)  and  when  there  are  10,000 
choices  (ICD-9-CM).  Horn  maintains  that  raters  should  be  given  credit  for  chance 
agreement  in  calculating  reliability,  but  most  other  investigators  do  not  agree. 

The   only   reported   use  of  SOU    under   supervision   other    than    Horn's    is   the 

Graduate    Medical  Education  Study   (Arthur   Young   Co.   1983).      In   a  preliminary 

analysis  of  a  10  percent  sample  of  this  data,  Richards  et  al.  (1986)  found  that  71.8 

percent  of  SOU  ratings  were  1;  23.1  percent  2;  3.0  percent  3;  and  2.2  percent  4.    Of 

577  cases  rated  more  than  once,  132  were  rated  twice,  417  three  times,  and  28  four 
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times.  There  was  complete  agreement  on  65.5  percent  of  cases,  a  disagreement  of 
one  point  on  32.0  percent  of  cases,  and  a  disagreement  of  more  than  1  point  on  2.6 
percent.  This  corresponds  to  an  interrater  reliability  of  78.9  percent,  which  is 
significantly  lower  than  Horn  has  reported.  Because  of  the  very  large  fraction  of 
cases  rated  1,  we  would  expect  raters  to  agree  by  chance  alone  in  57.0  percent  of 
cases,  so  the  observed  reliability  means  that  raters  agreed  in  only  50.9  percent  of 
cases  where  they  would  not  have  done  so  by  chance. 

None  of  this  analysis  provides  guidance  on  the  most  critical  question,  which  is 
the  impact  of  reliability  issues  on  the  payments  which  would  be  made  if  SOU  were 
used  for  PPS.  Such  data  are  not  available  and,  as  Lloyd  and  Kissing  (1985)  have 
shown  for  DRGs,  a  relatively  high  error  rate  can  have  small  impact  on  payment. 
Conversely,  in  a  system  such  as  SOU,  where  payment  differences  between  SOU 
scores  are  large,  the  impact  of  moderate  unreliability  could  also  be  large. 

There  is  no  information  yet  on  the  reliability  of  the  CSI.  Since  it  is  based  on 
adding  a  sixth  digit  to  the  1CD-9-CM  codes,  it  presumably  shares  some  of  the  same 
problems  as  DRG's,  Staging,  and  other  UHDDS  systems  as  well  as  problems  specific 
to  CSI.  On  the  other  hand,  the  attention  which  CSI  directs  to  specific  physiologic 
criteria  could  lead  to  more  accurate  diagnoses. 

G.  PREDICTIVE  PERFORMANCE.  SOU  research  has  been  reported  on  a  data  set 
consisting  of  70  percent  of  the  discharges  from  15  "primarily  large  teaching 
hospitals"  (Horn  et  al.  1984b;  Horn  et  al.  1985b).  The  abstractors  were  hospital 
employees  who  were  trained  under  Horn's  supervision,  including  regular  follow-up 
reliability  checks. 

1.  Performance  at  the  Case  Level.    Severity  procedure-adjusted  DRGs  explain 

61  percent  of  the  variance  in  resource  use  in  the  15  hospitals  (Table  3.11),  while 

DRGs  alone  explained  only  28  percent  of  the  variance  in  the  same  sample.     The 
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difference  in  performance  between  DRGs  and  SOU  is  probably  somewhat  smaller 
than  it  appears  because  the  statistics  appear  not  to  have  been  adjusted  for  the 
smaller  cell  size  in  the  SOU  measurements.  Severity-procedure-adjusted  DRGs 
appear  to  give  more  weight  to  procedures  actually  performed  than  do  current  DRGs, 
but  the  use  of  operating  room  procedures  to  predict  costs  is  already  accepted  in  the 
DRG  system. 

No  data  are  available  yet  on  CSI  and  its  performance,  either  at  the  case  or 
hospital  level. 

2.  Performance  at  the  Hospital  Level.  In  one  study,  Horn  (1985b)  converted 
charges  to  operating  cost  per  case  and  carried  out  a  hospital  level  analysis 
comparing  DRGs  to  SOII-procedure-adjusted  DRGs.  In  the  15  hospitals  in  this 
sample,  SOII-procedure-adjusted  DRGs  explained  81  percent  of  the  variance,  which 
was  not  significantly  better  than  the  75  percent  of  variance  explained  by  DRGs 
alone.  Horn  reported  that  SOII-procedure-adjusted  DRGs  would  come  substantially 
closer  to  paying  actual  hospital  costs  than  current  DRGs  but  the  methods  for 
determining  some  of  these  payment  implications  were  unclear. 

VII.  THE  VALIDITY  OF  COST  PREDICTORS 

In  this  section  we  examine  the  accumulated  evidence  on  the  validity  of  certain 

predictors   of  need  for  care,   independent  of  the  particular  system   in  which  the 

predictors  are  imbedded.    The  validity  of  predictors  is  important  both  because  it 

gives  construct  validity  to  a  system  constructed  from  these  predictors  and  because 

it  gives  us  guidance  for  future  system  construction.    Valid  predictors  are  also  likely 

to  lead  to  a  clinically  reasonable  system,  which  is  important  for  two  reasons: 

A  clinically  reasonable  system   is  likely  to  retain  its  power  even  when 

relative  costs  of  treatments  change  and  even  as  clinical  practices  and 
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technology  evolve. 

A    clinically    reasonable    system    is    more    likely    to    be    acceptable    to 

administrators    and    physicians    and    therefore    less    likely    to    stimulate 

gaming. 

A.  DIAGNOSIS.    Principal  diagnosis  is  not  a  strong  predictor  of  use  of  services,  as  is 

clear   from    the   weak  performance  of   DRGs,  Staging,  and   PMCs   for   nonsurgical 

discharges  (Tables  2.4  and  3.12).    In  DRGs,  which  rely  heavily  on  principal  diagnosis, 

nonsurgical  DRGs  account  for  about  10  percent  of  variation  in  resource  use.    Staging 

uses  patterns  of  diagnosis  and  performs  slightly  better  than  diagnoses  as  used  in 

DRGs.    PMCs,  which  use  procedures  somewhat  less  than  do  DRGs,  do  about  as  well 

as  DRGs  (Table  3.12).     On  the  whole,  evidence  from  these  systems  suggests  that 

considering  patterns  of  principal  and  secondary  diagnoses   may  be  slightly    more 

powerful    than    the    DRG    strategy    of   resting   assignment   almost   wholly    on    the 

principal  diagnosis  and  relying  on  a  uniform  list  of  secondary  diagnoses  which  will  be 

considered  comorbidities  and  complications.    On  the  other  hand,  there  is  no  evidence 

whether  ignoring  the  order  in  which  diagnoses  are  listed  is  a  profitable  strategy. 

This  approach  may  decrease  opportunities  to  manipulate  the  sequence  of  diagnoses 

while   increasing  opportunities   to   manipulate    the   system    by   adding   inactive   or 

marginal  secondary  diagnoses.    Work  in  progress  at  Systemetrics  will  push  the  limits 

of   combining   Staging    with    DRGs.       A    recently    funded    cooperative    agreement 

between  HCFA  and  Robert  Fetter  at  Yale  proposes  to  develop  lists  of  comorbidities 

and  complications  specific   to  each  principal  diagnosis,   which   might  make  DRGs 

more  powerful. 

One  approach  to  the  sequencing  problem  is  to  list  the  "primary"  diagnosis  first— 

that    diagnosis    which    accounts    for    the    greatest    use    of    resources    during    the 

hospitalization.      The  Veterans'   Administration  has  adopted   this  strategy,   which 
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specifically  addresses  the  anomaly  of  a  patient  who  develops  a  complication  during 
the  admission.  The  classic  example  is  the  patient  who  is  admitted  for  cataract 
surgery  and  has  a  heart  attack.  But  this  strategy  does  not  solve  the  other  problems 
of  sequencing  or  of  diagnostic  overlap. 

To  some  extent  the  weak  performance  of  diagnosis  results  from  a  diagnostic 
system  (ICD-9-CM)  which  generally  focuses  on  the  pathology  of  disorder  and  its 
location  rather  than  the  severity  of  the  disorder  or  its  degree  of  progression 
(Worthman  and  Cretin  1986  and  Averill  1986  discuss  this  limitation  of  ICD-9-CM  in 
detail).  But  the  weak  performance  of  diagnosis  also  reflects  the  lack  of  information 
as  to  the  purpose  of  admission. 

B.  SURGICAL  PROCEDURES.  Surgical  procedures  have  considerable  predictive 
power,  for  reasons  described  in  the  last  chapter.  In  combination  with  diagnosis,  they 
typically  account  for  30  percent  of  variation  in  resources  used  among  all  patients 
and  30-50  percent  of  variation  among  patients  receiving  surgery.  Further  use  of 
procedures  may  further  improve  explanation  of  cost  variation  (Mitchell  et  al.  1985b, 
Cromwell  et  al.  1985),  but  this  strategy  risks  an  increasing  return  to  cost-based 
reimbursement  (Chapter  2). 

C.  SEVERITY  OF  ILLNESS  ON  ADMISSION.    Evidence  as  to  the  explanatory  power 

of  severity  of  illness  on  admission  is  somewhat  limited.    Within  certain  groups  (e.g., 

ICU   patients),    severity    measures  perform    moderately   well  (Wagner  and   Draper 

1984),  accounting  for  17  percent  of  the  variance  in  resource  use.    On  the  other  hand, 

Coulton  et  al.  (1985)  found  very  uneven  results  when  applying  physiologic  measures 

within  DRGs  to  a  mixture  of  ICU  and  non-ICU  patients.    For  certain  conditions  such 

as  burns,  multiple  trauma,  and  low  birthweight,  the  correlation  of  severity  with  cost 

of  care   is   very   clear   for   patients   who  survive.      There   is  some    evidence    that 

admission  severity  measures  do  well  across  all  patients  (Brewster  et  al.,  1985a,b). 
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The  best  publicized  severity  studies  are  of  SOU  (Horn  et  ai.,  1985a,  1985b),  but  these 

mingle    severity    on    admission    with    severity    during    hospital    stay    so    that    the 

predictive  power  of  admission  severity  cannot  be  estimated.     Horn's  methods  are 

controversial  (Davis  1985,  MuUin  1985,  Horn  et  al.,  1985c).    Nevertheless,  there  is 

now  significantly  more  evidence  than  3  years  ago  that  clinical  severity  of  illness  on 

admission  is  strongly  associated  with  overall  costs  of  care.      This  same  data,  in 

addition  to  unpublished  work  by  Moskowitz  and  Ash  at  Boston  University,  suggest 

that  there  are  differences  in   average   severity   across   hospitals   which   would   be 

expected  to  lead  to  differences  in  cost.    The  work  of  Munoz  (1985,  1986),  which  is 

discussed  below,  is  consistent  with  this  evidence.    Generally,  however,  the  variation 

in  average  admission  severity  across  hospitals  appears  to  be  for  smaller  than  the 

variation  in  admission  severity  at  the  individual  case  level. 

Admission  severity  measures  may  become  easier  with  time.     As  hospitals  put 

more  patient  information  on  computer  systems,  a  system   like   APACHE,   CSI,  or 

MEDISGRPS  might  be  redesigned  to  eliminate  the  need  for  special  abstraction  forms 

and  to  abstract  the  necessary  data  from   patient  data  files.     Such  technological 

changes  in  storage  and  retrieval  of  medical  record  information  could  dramatically 

reduce   the   marginal   cost   of    implementing  case    mix   adjustment    methods    using 

physiologic  or  other  clinical  data. 

D.  PSYCHOSOCIAL  VARIABLES.     Some  authors  have  suggested  that  psychosocial 

variables  such  as  poverty,   living  alone,   being  disabled,  or   being  very  old   would 

increase  a  patient's  need  for  care  within  a  DRG  (Stern  and  Epstein  1985).    This  is 

consistent    with    findings    (Gordon    1985;    Altman    1985)    that    hospitals    with    a 

disproportionate  share  of  Medicaid  patients  have  higher  costs  per  admission  than 

other  hospitals  with  similar  characteristics.    To  examine  this  question,  3encks  and 

Kay  (1986)  examined  the  influence  of  a  number  of  beneficiary  characteristics  on 
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hospital  charges,  holding  both  hospital  and  case  mix  constant.  They  found  that,  in 
the  most  frequent  DRGs,  average  charges  for  beneficiaries  with  Medicaid  eligibility, 
previous  hospitalization,  age  (  80),  extreme  age  (  85),  disability,  and  end-stage  renal 
disease  as  well  as  beneficiaries  admitted  from  a  nursing  home  were  all  within  6 
percent  of  charges  for  other  beneficiaries  in  the  same  DRG  (Table  3.14).  While 
charges  were  higher  for  both  nursing  home  residents  and  the  aged,  the  two  effects 
did  not  interact — charges  for  aged  nursing  home  residents  were  no  higher  than  those 
for  other  nursing  home  residents.  While  these  findings  do  suggest  that  more  careful 
adjustments  for  age  may  add  some  precision  to  DRGs,  they  do  not  support  the  hope 
that  major  improvements  can  be  achieved  by  incorporating  sociodemographic 
variables  in  the  DRG  system. 

This  comparison  of  patients  within  hospitals  does  not  contradict  findings  that 
hospitals  with  a  disproportionate  share  of  Medicaid  beneficiaries  have  higher  costs 
than  similar  hospitals  with  fewer  Medicaid  beneficiaries  and  does  not  mean  that  the 
higher  costs  of  disproportionate  share  hospitals  are  unwarranted.  These  findings  do 
suggest  that  having  a  disproportionate  share  of  Medicaid-eligible  beneficiaries  is  a 
proxy  for  hospital  and  patient  characteristics  which  result  in  higher  costs  rather 
than  specific  higher  costs  associated  with  Medicaid-eligible  beneficiaries. 

E.  REFERRAL.  Patients  referred  or  transferred  from  other  hospitals  appear  to 
have  higher  costs.  Morrison  et  al.,  (1985)  found  that  psychiatric  patients  received  in 
transfer  had  charges  about  four  times  higher  than  direct  hospital  admissions.  Work 
in  progress  (Bobula  and  Cotterill  1986)  shows  that  hospitals  receiving  transfers  have 
higher  costs  and  that  hospitals  sending  transfers  have  lower  costs  compared  to  the 
average  hospital. 

F.  PURPOSE  OF  TREATMENT.   Purpose  of  treatment  is  rarely  even  coded,  and  data 

linking  purpose  of  treatment  to  costs  are  sparse.    Garber  et  al.,  (1984),  in  a  study  of 
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the  difference  between  teaching  and  nonteaching  services  at  a  university  hospital, 

found  that,  when  patients  were  matched  at  admission,  the  mortality  was  lower  and 

costs  were  higher  on  the  teaching  service.    These  data  did  not  permit  distinguishing 

quality  of  care  from  intent  of  care,  but  other  evidence  (including  equal  mortality  at 

9-month  follow-up)  suggested  that  the  difference  might  be  in  therapeutic  intent  to 

keep  dying  patients  alive. 

G.  KNOWLEDGE  ABOUT  THE  PATIENT  AT  ADMISSION.    Munoz  et  al.,  (1985,  1986) 

found  higher   costs    for    emergency   room    admissions   in   a   sample   of   New    York 

hospitals.     He  argued  that  need  for  inpatient  diagnostic  services  in  these  patients 

contributed  to  increased  costs,  but  this  effect  could  not  be  separated  from  possible 

differences  in  severity  of  illness.    Averill  (1986)  has  suggested  that  diagnostic  effort 

is  an  important  determinant  of  cost  after  severity  of  illness  has  been  considered,  but 

there  are  no  other  studies  to  confirm  that  knowledge  of  the  diagnosis  at  admission 

predicts  cost. 

H.    IMPROVED    CODING.      An    improved    diagnostic   coding   system    might   better 

predict  costs,  since  many  of  the  case  mix  systems  we  have  examined  are  really 

coding  systems.     ICD-9-CM  was  not  designed  to  code  either  severity  of  illness  or 

acuity   of   presenting   symptoms,    and   a   system    which    included    either    of    these 

variables  might  work  better.     Horn's  CSI  system  seeks  to  accomplish  this  goal. 

Authors  who  have  criticized  ICD-9-CM  have  suggested  refining  ICD  on  which  it  is 

based,  but  the  next  revision  of  ICD  is  due  in  1995  and  the  lead  time  is  so  great  that 

turning   the  revision   toward   measuring   severity   may   not  be  possible   even  now. 

Furthermore,  substantial  changes  in  the  ICD  system  might  create  serious  conflict 

with  epidemiologic  uses  of  the  system   which  require  consistency  from  decade  to 

decade. 

Advocates  of  better  coding  have  also  pointed  out  that  the  procedure  coding  of 
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ICD-9-CM  differentiates  procedures  by  purpose  rather  than  by  what  was  done  so 
that  an  endoscopic  biopsy  and  a  biopsy  by  operating  room  surgery  can  receive  the 
same  code.  They  suggest  using  a  procedure  code  such  as  CPT-^  or  HCPCS  instead 
of  ICD-9-CM.  Nobrega  et  al.,  (1985)  has  shown  that  such  a  change  is  probably 
feasible,  but  there  are  no  data  yet  to  show  that  it  would  improve  the  predictive 
power  of  the  system. 

VIII.  THE  FEASIBILITY  OF  ALTERNATIVE  CASE  MIX  SYSTEMS  FOR  PPS 

In  this  section,  we  summarize  the  ability  of  alternative  systems  to  replace 
DRGs,  to  refine  DRGs,  and  to  contribute  to  further  research. 

A.  PERFORMANCE  OF  THE  ALTERNATIVES  AS  DRG  SUBSTITUTES.  Examining 
the  DRGs  as  case  mix  measurement  tools  raised  concern  as  to  their  adequacy,  but 
examining  the  alternatives  underlines  the  substantial  strengths  of  the  DRG  system. 
The  basic  criticism  of  DRGs  is  that  they  do  not  measure  case  mix  well  enough,  but 
they  are  widely  accepted  as  a  "first  cut,"  an  acceptance  illustrated  by  the  degree  to 
which  they  are  taken  as  a  point  of  departure  for  refinement  systems. 

By  contrast,  only  two  systems  are  plausible  replacements  for  DRGs.  SOU, 
MEDISGRPS  and  APACHE  all  require  a  diagnostic  framework  within  which  the 
severity  level  or  score  can  be  applied  and  could  not  by  themselves  replace  DRGs. 
CSI  aspires  to  be  a  potential  replacement,  but  has  not  developed  to  a  point  where 
this  potential  can  be  evaluated. 

Computerized  Staging  and  PMCs  are  plausible  substitutes  for  DRGs.  Both  are 
reasonably  complete  case  mix  adjustment  systems,  which  have  been  developed  and 
documented  to  the  point  that  users  other  than  the  initial  developers  can  implement 
the  systems.    Coffey  and  Goldfarb  (1984)  compared  Staging  to  DRGs;  Calore  (1985) 
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compared  Staging  and  PMCs.   Two  findings  emerge  from  these  studies. 

First,  even  though  these  two  case  mix  methods  are  reasonably  complete  and 
documentation  is  available,  they  are  not  complete  in  the  sense  that  DRGs  are 
complete.  The  PMC  software  does  not  classify  5  (Young  1985)  to  11  (Galore  1985) 
percent  of  cases.  The  PMC  approach  to  patients  with  multiple  patient  management 
assignments  requires  accepting  the  structure  of  the  normative  PMC  weights,  and  no 
data  are  available  on  performance  of  this  approach.  Staging  now  classifies  almost 
all  patients,  but  no  mechanism  is  available  to  handle  patients  with  multiple 
assignments  and  the  risk  of  discrimination  against  referral  surgical  centers  appears 
substantial. 

The  second  finding  is  that  neither  Staging  nor  PMCs  is  better  than  DRGs  (Table 
3.10).  This  is  particularly  true  of  Staging,  which  actually  explained  about  two-thirds 
of  the  variation  in  costs  explained  by  DRGs.  The  difference  in  performance 
between  DRGs  and  the  other  two  systems  clearly  reflects  the  use  of  surgical 
procedures  in  assigning  patients  to  DRGs:  both  Staging  and  PMCs  perform  about  as 
well  as  DRGs  when  cases  have  already  been  classified  as  surgical  or  non-surgical 
(compare  Tables  2.4  and  3.12). 

Even  though  PMCs  and  Staging  do  not  outperform  DRGs,  they  might  do  better 

on  specific  tasks  such  as  explaining  the  higher  costs  of  tertiary  care  or  teaching 

hospitals.    Butler  and  Bentley,  (1982)  saw  Staging  as  a  system  that  would  be  better 

than   DRGs  for   teaching  and  tertiary  care   hospitals  (Butler  and  Bentley,   1982). 

However,  when  Coffey  (1985)  further  analyzed  the  differences  in  the  performances 

of  Staging  and  DRGs,  Staging  did  not  explain  the  higher  costs  of  teaching  hospitals. 

Hospitals  with  a  difficult  surgical  load  also  appear  to  lose  under  Staging,  compared 

to  DRGs.    Pending  further  work  that  shows  some  new  advantages  of  Staging,  there 

does  not  seem  to  be  a  compelling  reason  to  replace  DRGs  with  Staging. 
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PMCs  and  Staging  may  have  some  structural  advantages  over  DRGs  for 
reimbursement  because  they  generally  do  not  rely  on  procedures  (and  therefore 
services  rendered)  for  classification.  It  is  especially  promising  that,  using  only 
UHDDS  data,  PMCs  were  able  to  perform  as  well  as  they  did  without  a  heavy 
reliance  on  procedures.  In  addition,  the  concept  behind  PMCs—grouping  together 
patients  on  the  basis  of  their  needed  components  of  care — comes  closer  to  the  goals 
of  PPS  than  the  approach  used  in  DRGs—grouping  patients  based  on  actual  resources 
used.  Nevertheless,  unless  PMCs  produce  fairer  payment  than  DRGs  and  do  not 
discriminate  against  referral  surgical  centers,  there  is  no  compelling  reason  to  put 
them  in  place  of  DRGs. 

B.  POTENTIAL  OF  THE  ALTERNATIVE  SYSTEMS  FOR  REFINING  DRGs.    All  five 
systems  might  be  useful  in  refining  DRGs. 

1.    Staging.        Systemetrics    (Conklin    et    al.    1984a,     1984b;    Conklin    1985; 

unpublished    material)   has   developed   refinements   for   about   60    DRGs   based   on 

Staging.     These  refinements  rest  on  staging  the  principal  and  secondary  diagnoses, 

assigning  a  weight  to  each  secondary  diagnosis  based  on  regression  analysis,  and  then 

forming  subdivided  DRGs.      Although  the  computer  programming  is  complex,  the 

result    is    simply    a    somewhat    larger    number    of    DRGs    in    which    comorbid    and 

complicating  diagnoses  are  used  with  greater  sophistication.     Preliminary  results 

suggest  that  this  approach  increases  the  ability  of  the  DRG  system  from  about  36 

percent     of    adjusted    charges     to    about    41     percent    of    adjusted    charges — an 

improvement  roughly  comparable  to  that  achieved  by  MEDISGRPS  but  without  need 

for  special  chart  abstracting.    Work  on  an  earlier  version  of  this  refinement  (Table 

3.13;   Cretin  and  Worthman  1986)  suggests  that  this  refinement  would  measure  a 

somewhat   more  complex  case  mix  in  teaching  hospitals  and  inner  city  hospitals. 

Because  this  refinement  rests  on  use  of  secondary  diagnosis  while   MEDISGRPS  is 
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generic  to  all  diagnoses,  the  two  approaches  should  be  complementary  rather  than 
directly  competitive  and  might  do  well  in  combination.  Some  6  to  12  months  of 
additional  work  will  be  needed  to  confirm  the  initial  findings  and  to  define  the 
interaction  of  Staging-based  DRG  refinements  with  MEDISGRPS. 

2.  son.  The  developer  of  the  SOU  has  strongly  advocated  using  SOU  to  modify 
^  DRGs.  However,  the  subjective  nature  of  the  system  and  the  absence  of  public 
training  materials  and  formal  methods  to  resolve  conflict  raise  very  serious 
obstacles  to  this  strategy.  Even  the  superficially  objective  DRG  system  suffered 
from  an  unexpected  amount  of  "creep";  the  more  subjective  SOU  could  well  create 
more  serious  problems.  The  logistics  of  implementing  and  policing  a  manually 
coded,  subjectively  assessed  national  system  would  be  extremely  formidable.  Horn's 
current  focus  on  developing  the  more  objective  CSI  suggests  that  she  recognizes  the 
importance  of  the  subjectivity  issue. 

An  equally  serious  problem  is  that  SOU  has  been  examined  only  in  15  primarily 
large,  teaching  hospitals,  all  of  which  had  CMIs  above  1.00  (Horn  et  al.  1985b). 
There  is  no  evidence  as  to  the  redistributive  effects  of  an  SOII-based  refinement 
among  different  classes  of  hospitals.  When  Disease  Staging  was  tested  only  in 
teaching  hospitals,  it  appeared  to  capture  the  range  and  severity  of  cases  (Butler 
and  Bentley  1982),  yet  when  Staging  was  tested  on  statewide  data,  it  did  not  appear 
to  be  an  improvement  over  DRGs  (Coffey  and  Goldfarb,  198^^). 

CSI  is  designed  to  remedy  many  of  the  obstacles  to  use  of  SOU,  but  assembling 
a  national  data  base  to  test  the  proposed  system  will  be  both  expensive  and  time- 
consuming.  Speculating  on  the  suitability  of  CSI  as  a  refinement  or  replacment  for 
DRGs  is  clearly  premature. 

SOU   may   be    most    valuable  as  a  research  and  hospital   management   tool  if 

technical  questions  regarding  reliability  measures  can  be  resolved  and  if  the  risk  of 
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coding  actual  care  instead  of  needed  care  can  be  shown  to  be  controllable. 

3.  PMCs.  PMCs  might  be  used  to  refine  DRGs  somewhat  as  Staging  has  been 
used  by  Systemetrics.  The  fact  that  the  two  systems  are  very  different  in  structure 
is  less  of  an  obstacle  to  such  a  strategy  than  the  inadequate  current  state  of  PMC 
documentation.  Katherine  Galore  at  the  Center  for  Health  Economics  Research  has 
started  to  explore  this  approach. 

li.  APACHE.  APACHE  has  demonstrated  ability  to  predict  costs  in  the 
critically  ill,  and  it  might  be  especially  useful  in  paying  for  outlier  cases,  where  data 
costs  would  be  moderated  by  the  small  number  of  cases  involved.  This  approach 
corresponds  to  concerns  that  hospitals  might  tend  to  avoid  critically  ill  patients  who 
have  a  high  risk  of  becoming  outliers.  However,  current  data  to  assess  this  approach 
are  not  available. 

5.  MEDISGRPS.  MEDISGRPS  could  improve  the  DRG  system  somewhat,  but 
evidence  that  it  can  make  payment  more  accurate  must  await  collection  of  new 
data,  such  as  that  now  being  collected  by  the  PROs  for  HCFA's  Health  Standards 
and  Quality  Bureau.  The  data  burdens  of  MEDISGRPS  for  Medicare  patients  are  not 
yet  fully  defined. 

6.  Conclusions  Regarding  DRG  Refinements.  Two  types  of  refinements  to 
DRGs  can  be  considered:  those  which,  like  DRGs,  only  require  UHDDS  elements  to 
operate  and  those  which  would  require  the  expansion  of  UHDDS  to  Include  reason 
for  admission,  physiologic  test  results,  or  other  elements.  In  the  short  run,  UHDDS- 
based  modifications  are  easiest  to  implement  quickly. 

The  Staging-based  modifications  to  DRGs   are  the   most  highly  developed  of 

available   modifications.      These   refinements   can   be   made   using  data  which  are 

currently   collected   but   would    require    significant    modifications   to   the   Grouper 

software     which     assigns     DRGs.        They    appear     to     appreciably     increase     the 
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classification  power  of  DRGs,  but  their  impact  across  hospitals  is  uncertain. 
Evaluation  of  this  modification  using  actual  cost  data  from  the  first  year  of  PPS 
appears  to  be  desirable. 

Other  promising  routes  for  DRG  improvement  include  more  precise  adjustment 
for  age,  adjustments  for  transfers,  inclusion  of  whether  a  patient  came  from  a 
^  nursing  home,  and  use  of  a  better  list  of  diagnoses  for  comorbidities  and 
complications.  The  possibility  that  emergency  room  admissions  are  more  expensive 
is  intriguing  but  it  is  difficult  to  see  how  this  could  be  used  to  determine  payment 
without  inviting  manipulation. 

Of  the  remaining  systems  (APACHE,  MEDISGRPS,  SOU  and  PMCs),  APACHE 

seems    closest    to   being    implementable.       MEDISGRPS    (with    its    extensive    data 

requirements)  and  the  new  CSI  (with  its  requirement  for  implementing  a  new  system 

of  diagnosis  and  procedure  coding)  are  not  yet  fully  developed.    SOU,  with  its  labor 

intensive,  subjective  measures,  may  be  useful  in  research  and  hospital  management 

but  appears  unready  for  use  in  payment. 

C.  WORK  IN  PROGRESS  AND  FUTURE  RESEARCH  NEEDS.    As  other  parts  of  this 

report  suggest,  a  critical  problem  in  DRG  refinement  is  that  we  do  know   many 

problems  in  the  current  system  but  we  do  not  know  which  are  most  important.    If 

problems   are   concentrated   in   outliers  (Chapter   4. VI),    then   systems    with   broad 

categories  are  likely  to  be  relatively  ineffective  improvements.     Results  to  date 

with  Staging  suggest  that  this  is  the  case.      If  problems  reflect  differences  in  the 

purpose  for  which  the  patient  is  treated  rather  than  the  patient's  condition,  then 

new  approaches  to  classification  are  needed  and  detailed  attention  to  certain  kinds 

of  procedures  are  in  order.    If  the  difficulties  result  from  differences  in  the  degree 

of    diagnostic    and    therapeutic    aggressiveness    between    hospitals    and    between 

physicians,   then   Medicare   will  need  either  to  pay   for  these  differences  without 
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determining    their    efficacy    or    to    find    ways    to    pay    for    outcomes   rather    than 
treatments. 

1.  Work  in  Progress.  A  number  of  studies  now  in  progress  will  yield  significant 
clues  as  to  the  relative  performance  of  a  number  of  currently  available  measures. 
The  Graduate  Medical  Education  study  will  not  only  compare  the  performance  of 
SOU  and  manual  Staging  (both  now  somewhat  dated  systems),  but  will  look  at  the 
interplay  between  case  mix,  quality,  cost,  and  appropriateness  of  care  in  a  selected 
group  of  hospitals.  Ashcraft  and  Thompson,  at  the  University  of  Michigan,  are 
pursuing  case  mix  evaluation  in  a  consortium  of  teaching  hospitals.  Alemi,  at 
Tulane,  is  looking  at  the  measurement  of  severity  in  myocardial  infarction  cases, 
lezzoni,  at  Boston  University,  is  examining  case  mix  measures  across  hospital  types. 
HCFA  is  sponsoring  research  at  the  Brandeis  and  Rand/UCLA  Health  Care  Financing 
Research  Centers  to  review  the  adequacy  of  the  current  case  mix  adjustment, 
looking  for  evidence  of  skimming  and  dumping,  regional  practice  variations,  and 
appropriateness  of  care. 

These  studies  should  provide  significant  guidance  as  to  how  much  information 
physiologic  variables  can  add  to  case  classification  and  supply  guidance  for  possible 
large-scale  studies  involving  primary  data  collection. 

2.  ORG  Refinement  Using  Staging  and  PMCs.  Conklin  (1985)  has  already 
demonstrated  that  Staging  and  DRGs  can  be  synthesized  to  produce  a  system  which 
is  somewhat  more  powerful  than  current  DRGs  without  requiring 

additional  data  collection  and  SysteMetrics  is  continuing  work  in  this  area.    Young 

has  viewed  PMCs  as  a  system  so  different  from  DRGs  that  synthesis  is  not  possible, 

but   PMCs   may   nevertheless  provide   useful  tools   in   refining  parts  of  the   DRG 

system.    Calore  and  colleagues  at  the  Brandeis  Center  are  exploring  this  approach. 

Although  the  improvements  promised  by  these  strategies  appear  to  be  modest,  such 

3-38 


improvements  also  appear  to  be  appropriate.  Such  steps  appear  to  be  more  useful 
than  comparisons  of  the  existing  DRG,  Staging,  and  PMC  systems  on  larger  data  sets 
since  PMCs  and  Staging  do  not  appear  to  provide  sufficient  advantages  to  consider 
using  them  as  replacements  for  DRGs. 

3.  Sequencing  Diagnoses  and  Improved  Comorbidity/Complication  Lists.  A  wide 
variety  of  evidence  indicates  that  sequencing  of  diagnoses  is  inherently  subject  to 
ambiguity  and  error.  Likewise,  the  lists  of  comorbidities  and  complications  are 
oversimplified  and  nonspecific.  Another  source  of  work  in  this  area  is  a  cooperative 
agreement  between  HCFA  and  Robert  Fetter  at  Yale  University,  one  of  the 
developers  of  the  DRG  system.  This  project  also  seeks  to  refine  the  use  of 
comorbidities  and  complications  in  the  DRG  system,  using  an  electic  approach  which 
draws  on  a  number  of  sources.  The  award  was  made  in  August  1986,  and  the  project 
is  still  in  an  early  stage. 

^.  DRG  Refinement  Using  Physiologic  Measures.  Studies  in  progress  will 
provide  considerable  evidence  on  use  of  physiologic  measures,  but  none  will  provide 
definitive  information.  This  is  due,  in  part,  to  the  fact  that  the  systems  are 
evolving  so  rapidly  so  that  new  versions  will  almost  certainly  be  in  place  by  the  time 
results  from  the  current  studies  are  analyzed.  A  second  problem  is  that  each  system 
is  developed  and  tested  on  a  limited  data  set,  so  that  cross-comparisons  are 
difficult.  Nevertheless,  further  information  on  whether  incorporating  physiologic 
measures  into  PPS  is  desirable. 

A  study  on  a  nationally -representative  group  of  hospitals  and  patients  can 
give  a  picture  of  the  payment  impact  of  applying  physiologic  measures  to 
PPS  payment.  There  are  insufficient  data  today  to  predict  the  payment 
impact  of  any  of  the  systems  which  do  not  rest  on  UHDDS. 
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A  collection  of  physiologic  data  and  corresponding  costs  will  be  necessary 
in  order  to  determine  appropriate  payments  when  physiologic  measures 
are  used  for  payment.    There  is  currently  no  source  for  such  "calibration 
data." 
Designing  a  comparison  requires  approximate  answers  to  three  questions: 

Should  Medicare  seek  to  adopt  an  existing  "off  the  shelf"  system  or  to 
construct  a  system  which  synthesizes  elements  of  several  competing 
designs?  Given  the  problems  with  existing  systems  and  the  difficulties  of 
publicly  examining  proprietary  systems,  this  policy  issue  must  loom  large 
in  the  design  of  any  comparison  study. 

For  what  patients  will  physiologic  measures  be  most  useful?  Is  it  possible 
to  focus  on  outliers  or  on  patients  with  unusual  conditions?  A  comparison 
study  designed  to  pilot  the  adoption  of  a  universal  system  will  be  very 
different  in  design  from  one  intended  to  resolve  payment  problems  for 
certain  groups  of  patients. 

What  clinical  data  are  most  useful  for  classification?     Can  a  system  be 
confined  to  rigorously  defined  measures  such  as  temperature  and  serum 
potassium,  or  must  it  use  softer  findings  such  as  "pleural  tightness"  or 
"jugular-venous  distension."    The  difficulties  of  using  subjective  elements 
are  clear,  but  the  feasibility  of  avoiding  them  is  uncertain. 
Studies  now  in  progress  by  Alemi,  Ashcraft,  lezzoni,  and  others  will 
serve  as  pilot  projects  whose  results  will  provide  guidance  on  these  three  issues  as 
well  as  other  aspects  of  study  design.     The  study  itself  will  then  allow  HCFA  to 
make    an    informed    decision    about    incorporating    physiologic    measures   into    the 
payment  system. 
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E.  CASE  MIX  AND  CAPITATION  RATES.  The  problems  of  determining  the  correct 
payment  for  a  hospital  admission  appear  to  have  a  natural  relationship.  Analysis  to 
date  suggests,  however,  that  case  mix  refinement  may  have  much  less  to  contrbute 
to  capitation  rates  than  one  would  hope.  The  problem  is  simply  that  the  acute 
severity  of  illness  and  intensity  of  care  which  contribute  so  much  to  determining 
hospital  costs  are  not  useful  for  describing  a  beneficiary's  expected  use  of  services 
over  the  course  of  a  year.  Rather,  capitated  costs  appear  to  be  determined  by 
chronic  health  status.  For  this  reason,  efforts  to  refine  the  measurement  of 
comorbidities  in  the  DRG  system  may  be  the  most  important  common  theme  in 
refinement  of  DRGs  and  capitation  rates. 

F.  CASE  MIX  MEASUREMENT  AND  MEASURING  THE  QUALITY  OF  CARE.  The 
methods  and  objectives  of  case  mix  measurement  are  central  to  measuring  the 
outcomes  of  care.  The  central  feature  of  such  measurements  is  to  compare 
outcomes  of  care  provided  to  similar  patients  by  different  providers  or  in  different 
settings.  The  key  problem  is  assuring  that  patients  are  indeed  similar;  measuring 
outcomes  presents  some  difficulties,  but  certain  important  outcomes,  such  as  death, 
can  be  ascertained  with  relative  ease  and  great  accuracy. 

The  best  criterion  for  classifying  patients  for  case  mix  measurement  is  the  cost 

of    efficiently    providing    necessary    care,    while    the    best    criterion    for    quality 

measurement  is  the  probability  of  some  outcome  (such  as  death  or  return    to  work  or 

recurrent  hospitalization  or  freedom   from  pain).     Nevertheless,  the  problems  are 

closely  analogous  because,  as  we  have  argued  above,  clinical  homogeneity  of  case 

mix    measurement   groups   is   a   natural   and   efficient   basis   for   such   groups.      The 

emphasis  on  severity  measurement  in  case  mix  measurement  suggests  how  naturally 

clinical  homogeneity  has  become  an  objective  in  case  mix  measurement. 

Available  ineasures  are  in  many  ways  more  applicable  to  quality  of  care  studies 
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than  to  case  inix  measurement  at  this  time  because  studying  the  quality  of  outcomes 
or  just  a  few  conditions  promises  to  be  much  more  useful  than  improving  case  mix 
measurement  for  just  a  few  conditions.  This  is  true  because  there  is  enough  cross- 
subsidization  within  hospitals  to  make  the  cost  of  treating  a  particular  condition  less 
informative  about  the  overall  efficiency  of  the  hospital;  the  problem  does  not  rise 
for  quality  of  outcomes. 

G.  SUMMARY.  The  clearest  conclusion  of  this  chapter  is  that  there  is  no  persuasive 
alternative  to  DRGs  and  no  revolutionary  improvement  is  readily  available.  A  series 
of  modest  improvements  appear  possible,  and  may,  in  combination,  produce  even 
larger  improvements.  Work  to  measure  quality  of  outcomes  is  critically  related  to 
classification  for  payment  because  we  can  only  design  payment  systems  which 
maximize  quality  at  minimum  cost  when  we  better  understand  what  kinds  of 
outcome  are  most  affected  by  cost.  Any  project  which  significantly  improves  DRGs 
as  a  payment  system  is  likely  to  yield  substantial  rewards  for  measuring  quality  of 
care. 


3-^2 


O    0 

CO  a 

00 


B>  - 


0 

w 

I— 


ct  in 
X  o 


(t 

V 

—  0  cr  cfi 

X 

ro  -x)  o  •<  o 

n 

0  a      « 

a 

3 

c  (I  :r  <-< 

T 

o 

►1    "!    c 

l-^ 

3 

r+  u   te   M 

n> 

c+ 

3-      T)  cr 

D 

5" 

1-  c+ 

O 

u 

O         r+  1- 

A 

y      0  3 

g    Percent 

1    Disagreement 

Case 
eck;  Classification 

•0 

a 
»i 

o 
a 

3 


H* 

?; 

JO  3 

OD 

Q) 

tn 

c 

u 

a 

rf 

a  o 


>< 

0 

•0 

»i 

*-s. 

Q) 

0 

m 

CB 

3 

X 

o  c^ 

CD 

CD 

0 

0,   . 

n 

I—I 

01   o 

U 

«< 

c 

(I    w 

c 

a 

OJ    o 

►^ 

OJ 

a> 

n 

►1 

u 

a 

f* 

►^ 

a 

<< 

<i 

v; 

to  CJ 

-0 
CD  •-> 

1-  cz 

«0  M 

CD 

t-  o  > 

3  >-b  cr 

CL  U) 

a   O  r* 

X    3"  I-! 

o  o 

3  r+ 

n  o 

3 

0) 
>— ' 
rt- 

y 


n  >  en 

3"  cr  (I 

•n    01  o 

O   r+  0 

3    »1  3 

►*  c»  a 
o  o 

c+ 

3-  M- 

»  o 

C»    3 


>  tl 

OQ  (I 

•1  "! 

(t  n 

a  a 

3  3 

cr  c+ 

3 


to  H-  cn 
o» »  0 

>«  01    3 

01   ct 

r*  C» 

y  (t- 

B> 

3 


I-*-  O    > 
(-••  --h  CT 

rtr  01 

3  "D    rt- 

ID     3'  ^! 

^"<  n 
>  I-  r+ 

TJ  O    I-* 

cn  ►-  o 

^03 

N3in 

— '  w- 
o 


""0    >  CO 

<^  :r  V  (t 
a  "<  01  o 
3  01   r*  o 

01    H-  ►I    3 

o  tt  a 

>-^  n 

O   rt- 
«   I- 

^  O 

n  3 


O"  rt-  OO   rt> 
01    rti    II    1 


n-  s 

O 
c-t- 

O 

3 


3   3 

(l  rt- 

3 
c-i- 


Z  >  O 

C    01  T 

i-l    01  I— 

01   t--  <-•- 

n  Ob  (i 

01    3  T 

3  >- 

(t)  O 

3  3 
rf 

cr 


-0  t-b  T) 

t--  O  3 

c  II  a 

01  en 

> 

c:  a  33 

X  3  a 

O  >—  CB 

O  01  01 

en  01  0 

►-  3 

o 

3 

0  n  T 
i-i  a  (t 
a  01  f^ 
o;  a  n 

01  tJ 


> 

DC 

a> 

(t 
3 
(I 


> 

l-H    I-* 

o  n-  tfi 

n 

o  o 
a  0 


w  o 


1-  ~i 

to  o 

CD  c 
tn  3 


0  IV3 

p)  CO 

01  - 

ff    -3 

01   to 
o 


z  >  o 

C     W  ►! 

t-l    01  M- 

01    I--  rt- 

01    3  »1 

3  I- 

n  o 

3  3 


cr 


o  c:  'D 

3    13 

>-"a  n 

•<    O  CT 

en 


n  n  T) 

►-^  CO    (II 

a  (fi  f^ 
man 
te       a 

t-         3 


> 
OO 
I"! 

(t 

3 
3 


> 
1-1  i-fc 
O  rt-  to 
O  (t   O 

o 

o  n 
a  0 

3    II 

n  ft 

^  n 

rt- 

H- 
3 
(Ki 


=  a  (T  3 
01  en 


c  > 

z:  a 

D  3 

c  t- 

en  01 

01 

o 

3 


M-  03 

3 

«  o 
n  01 

01  to 

t-f  I- 
ft  t-t- 
►^  ct 

3    I— 
01 
tTJ 

cv 


70  Tl 


o  c:  "0 

3X3 

t-  on 

><  o  en 

en 


0  O  >  *T) 

I— '  t»  oci  ft 

CD  01  ►I  n 

01  ft  ft  n 

W  ft    ft 

t-        3   3 
i-ti        ft   rt- 

V-         3 
0        rf 
a> 
rf 


en 
o 

c 

o 


en 

> 

en 

3 

5 

tr 

•T) 

m 

c- 

m 

o 

m 

en 

o 

JO 

en 

1— ( 

o 

•n 

c: 

H 

PC 

l-H 

n 

o 

m 

z 

^ 

33 

^- 

n 

-^ 

^^ 

•D 

> 

« 

33 

a 

X 

O 

n 

> 

a 

m 

-3 

c: 

o 

t>; 

1— 1 

♦— 1 

\ 

en 

CD 
1— 1 

t- 

n 

c- 

o 

H- 1 

3 

H 

33 

-0 

•< 

> 

> 

H 

33 

»-H 

m 

z 

o 

o 


n 

O  3 

3  m 

t)  > 
>  e/o 

33  c: 

l-H  3) 

en  m 
o 
z  o 


'J) 

o 

c 

H 

0 

a 


n 
a 


CO 
03 


®  o 

(9  C 


tt  - 

CO  u 

(I  CD 

u  o 


o 

D 


u>       a 

c-^  3  00 

O    C 


ff 


c  o  en 

a  o  f+ 

03  01 

0*0  » 

en  c  » 

f* 

a  »  cr 

B) 

►<  S 

f* 

a 

0 

3 

^  0) 

0)    1) 

•C    »    0    TJ 

a  H- 

0   1-  f*  »   (» 

1    H»  3    (» 

^  3 

i-b  3    0) 

►1    1 

M-     -b               HJ 

o»  o 

O  OQ 

(J   o 

3    (H    U    O 

«    0 

O    1-  » 

»  » 

0    0    c*  (t 

3    1 

»    C 

3    3 

M-  c*   0)    3 

0    '^ 

u  a 

«    c+ 

T)         m    rf 

01    (t 

(t  m 

3 

0)    0    (I 

h^  o 

U)    U) 

f+ 

►-   H,            Q, 

M    c+ 

^        '-00 

.  «w 

<  » 

0 

^^  (li    1 

•0 

►-  i-b 

3 

D    X    (T 

n 

C*  -b 

O    •--  (H 

^* 

•y  « 

0    C    3 

3 

o 

►t  a  (J 

n 

c* 

1    »    3 

M- 

(I    ts    c+ 

•o 

O 

a 

< 

00 

a 

o 

H** 

n 

C71 

3-  *< 

c+ 

(J 

a 

X    -1 
(»    CD 


■  (X> 

-  t 

u 

1-  <-*• 

u>  a 

CD  1^ 

tn 

cr  » 


o 
o 
a 

►<  o 

»  o 


w  rr  0) 

cr  o  cr 

u  u  u 

»  c+  a 

o  oj  o 

r+  t-  f+ 

O 

fi  cr 


B> 

3-  u"  a 

cr  0 

<< 

jy 

u 

u 

w 

f+t) 

u> 

c+ 

►^ 

►* 

a 

►1 

» 

f* 

o 

0) 

0 

09 

0 

0 

e* 

t-" 

D 

c* 

0 

a 

n 

w  o 

r+  O 

»  D 

3  c+ 

Q,  p-( 

CD  C» 

*^  n 

a  c+ 

c 

I-b  Bl 

O  1- 
1 


»    0)    2 

X  cr  p] 

•o  «  a 

ff  f+  •-' 

*^  '^  en 

n  S3 

cr      en 
M  cr 

» 
o 

c+ 

o 
*^ 


1  o  a*  n 
0  u  tn  o 

D  H*  1  en 

»  r»-  »  n 

a  tp  o  70 

t-  f+  ti 

en 


<0  »    3    I-b  t*  -TJ 
O    »    0)    1    (» 

n  IS  >-  a  n 

c+  H-  H-  O 

e^  3    » 

a    3 


A  C 

a  >^ 

jr  t»  CD  3 

u  o  yioo 
"< 

X  01 

•O  ** 

<»  rt- 

3 

n 
o 


i 


o  o 

3   3 

a 

H-   3 

(o  a 

00 


a  - 

CR  ife 

A  CD 

N  CO 


o 

CO 

r+ 
0) 


(t  en 
X  o 

•0   -' 

0 
a>  ^ 

c*  3 

cr 


-^  n  cr  en 

**)  o  <<  o 

n  cs  0  Q.      i-H 

3"  C    C    (t  3-  >-i 

(»  cr  "1  1^  o 

o  c«  c+  CO  t«  n 

X  a    3*  "O   Q) 

CO    iD  H-  t+ 

^  C    0>  c*  H* 

»    3  B>   3 

3  a  H-oo 

<-<■ 


O  O  O  Tl 

^  0!    ►-  CD 

D>  CO  en  I-I 

CO  (V  0)  n 

(0  OQ  (» 

K-  H)  3 


M> 

« 

H- 

(* 

0 

3 

0) 

(» 

c+ 

3 

H^ 

c+ 

0 

3 

>  C  en 
<  a  ■ 

ri  in  »( 

»  c+ 

(»    A 

a 


en 

< 

50 

-< 

o 


tr 

z 
m 
en 
en 


z 
o 
m 

X 


o 

3 
ft 

a 


> 

00 


o 
o 

3 

a 


CD 

•     • 

0 

• 

C 

Ui 

-t 

H» 

o 

yg 

» 

»  "0 

►J  ^ 

a  a 

n 

•0 

►t 

O    (t 

a 

C    X 

f+ 

<♦  o 

►-- 

t-"  1—1 

D 

n  c 

a 

(t 

h-*  a> 

c+ 

3    (J> 

r* 

0> 

(t  n 

H- 

1    0 

<  ^1 

a  0 

h^ 

3    3 

(C 

r+  0) 

OD 

►*  1 

<J5 

0  •< 

3 

> 

cn  1 

O   e* 

O    (t 

►t  1 

>-'< 

3 

«    03 

"< 

cnT3 

•<    £» 

01    M 

C*  M 

» 

3   'Jl 

c 

^  "^ 

H« 

►-•  (t 

CD  1 

cn'< 

< 

a 

c 
<t 

0 

H> 
03 

o 

< 

u 

Q) 
U 
U 
I— 
HE) 

3 

a 

c+ 
o 


en 
en 

U) 

o 
o 
<^ 

9 
ca 

OQ 

(t 
» 

a 

f+ 

3* 
0) 

3 

CJ 

en 
o 


H-  o  a: 

CD  1 

01 

OD  0) 

OQ 

Jk-O 

3 

ft 

a 

t 

n 

9 

3 

a 

O  V 

a  CO 
U)   o 


CO  cj 

CO  i-i 
I   o 

CO  u 

CO 

CO 


CD  0>  > 

c  s"*  ns 
>n  H  (»  to 

»    O         "- 

K-   3  tSJ 

n  I- 
0)  n 

I— ' 

=r 
u  A 

C-f    0> 
0)    ^- 

(+  c* 

3" 


H  H  n 

I-I  1    0 

CO  g  c* 

♦  3       : 

•  ft 
w  a 


a  i-i  tn  o  >  > 

O   O  C    3*00   TJ 

tB  c  ^s  II  ft  in 

"O        OQ   O         I-" 

H-  >  t-  3       ro 
cf  a.  o  H- 
»  3  01  n 

to  ft 

a  f*  0) 
X  »  ^ 

C+    ft 

3- 


-9  H  O 

M  "I    O 

cn  H-  01 
en  3  f* 

♦  3 

•  ft 

w  a 


CO  3 

CD  a 

(j\  C 

ft 


(^  C71 
0)  - 
U)    O 
ft    CJ 

u  o 


CO  CJ 

-0 

CO  •— < 
I   o 

CO  01 
CO 

to 


•-H  en  o  >  > 
o  c  yoo  TJ 
G  n  ►I  ft  en 

00  o         H- 

>  ^*  3      to 
a  o  K- 
3  »  n 

01  ft 

a  cf  0) 
X  o>  ►- 

3* 

3: 
o 
►» 

u 


to 

00 

to 

CO 

n 

^  o     — 

»    O           CO 

01    in           C71 

01    1    O  • 

H-  ft    »    W 

Hk  O    01    »< 

M-  f*  ft  w 

ft    ►-  01 

av: 

tn 
o 

c 

n 
m 


tn 


tr 


en 

O 

c 

O 

m 


>  ►-< 

c-  > 

m 


en 


•73 

r- 
pi 

o 
PI 
en 
n 

50 

'D 

o 

z 


n 

50 

o 


^  > 
o  z 
>o 
en  m 
p] 

o 
r  "fl 
PI 

<  > 
Pit) 
r  > 

w  o 

X 

PI 


-9 
> 
CO 

r 
PI 


<  z 

WPl 
•IT) 
>PJ 
CS  Z 

r-o 
Pipj 

z 


SOTJ 

HI  PI 

>z 

too 
r  pj 
piz 


P) 

o 


> 
z 
o 
PJ 


en 

0 

c 

►^ 

-  o 

o 

CC  Q) 

a 

00  ^ 

tri  0 

1 

n 

(t 

<^ 

a 

c+ 

H- 

•s 

(I 

r+ 

Q> 

w 

►-- 

o 

n  o 

»  - 

I-- 

M    h- 

CD 

fs  to 

OD 

M  ro 

O) 

t— » 

•X 

cc 

M- 

OD 

o 

ro  ■y 

H^ 

« 

a 

D 

2 
a 
o 

(5 


-3  C 
1    3 

I—  c+ 

3  ^ 
3   t- 

«  3 

a  3 
a 


CO 
c* 
0) 

n 


o 
o 
u 

r+ 


-J  o 


I-  n  n 
«3  o  o 

03  1-'  Hi 

1^  a  i-ti 

i-b  ft 

»  ■< 


CJ 

Ol 

CO 

> 

o .»» 

■s. 

0)  - 

TJ 

U)   o 

c- 

(t   o 

t=q 

U)   o 

—  3 

<£>•<. 

I  h^ 
OO  01 
^  3 

a 


o)  en 

00      ^  en 

CC  c+ 

O  O  1)  rf 

CO  Q) 

fT"  ►-  i-(    » 

DO 

0)    t^OQ 

OQ    ft 

n  DC)  3  ft 

►^  « 

i-l    3    O    W 

o 

0   O    M- 

c 

C    W  -0 

•0 

•0   •—  0) 

u 

01    W    1— 

tn 
o 

c 

o 
p] 


en 
O 

c 

O 

m 


3: 

r-  o 
r-  > 

m 


z 
o 

-3 


en 

> 

3 

c- 

o 

en 
n 

15 

I— I 
O 

z 


<  z 
>o 
»m 

>M 

ts  z 
c-  o 

z 


<o 

>  M 
33  "TJ 
i-i  P3 
>Z 
B)0 

rr  PI 
n  z 

H 


c 

>50 


M 

s 

> 
Z 
O 

^  P3 

O 

>  O 

en  "q 
pj 

o 
r  '-I 
PI  en 
<  PI 
P5  > 

r-  en 
en 

> 
C3 


O 


> 
00 


gs 


o< 

MPJ 
r*]> 

<P1 


en  CO 

^ 

o 

O 

ae 

W 

?«5 

3 

o 

Cd 

3: 

0    0) 

c 

a 

t-»- 

cu 

(t> 

hJ- 

(D 

M- 

K- 

(0 

a 

C   3 

•'->  M 

—  ^1 

^  Hj 

'-^  C_u 

—  3 

^  a 

.^-vC_fc 

,— N 

£0 

"^  t— ' 

y-N. 

C-t. 

o 

h  V 

03 

o© 

o  ^ 

O  0 

OH- 

O  3 

OO 

ocroM- 

oo 

3 

a  h- 

■   »0 

50  0*  WypO  ►I 

»« 

S)  CD 

SOt^ 

POCK 

»  » 

»^^ 

*0 

cb  a 

03 

o- 

o  o 

o 

O  3 

0'< 

a 

O  c+  O  ►I 

c:^ 

CB 

•  ■  •  • 

0) 

Cfl 

o 

w 

CP 

*< 

» 

o^ 

.-3  ^ 

•-  < 

^^  M- 

^  3*  CJ  "tJ  C*)  OJ 

H-  o 

ro 

Cfl 

l-» 

t-* 

CP 

CP    I-*- 

O  fO 

ODs; 

^  V 

O  M 

CJi  (P 

*«  n 

ro  3 

rf*'  X 

CO 

CO  -3 

I-* 

O 

<    Cfl 

*^  o 

•>-^ 

w   to 

ro 

w    C0 

CD  0 

O  a  CD  (P 

Ji» 

-J  ►< 

o 

. 

H-  O 

(D 

m 

o 

wgO 

d- 

CA 

- 

t-" 

a> 

< 

B)   3 

ri-'O 

3 

\  (?»  C+  CO  C  fi" 

^ 

ff>  o 

ff>  0> 

d- 

H-  (P 

C 

> 

C- 

0 

^^  H 

►fl 

rt- 

u 

H-  0 

5   >1 

o 

(P 

t-- 

c 

CJ  C+ 

(-'  t— » 

H"  1^ 

ro 

H-» 

I-' 

o 

0    H» 

O 

1— • 

X 

o 

3 

J»>  » 

■• 

*.  0 

CO 

CO 

(-» 

• 

3 

a  a 

c 

0 

00 

CO 

CO 

CO  o 

vy 

CD 

t-* 

T) 

c+  3 

u 

"-3 

3^ 

w  O  ro 

>^  • 

s^ 

v_^ 

•.^ 

W  CP 

c+ 

t— 1 

0 

M-  ro 

O  X   ^1 

0) 

> 

t— ' 

Cfl 

^^.^ 

CP  13    O 

>->  M 

• 

• 

/— s 

t) 

h-'  CP 

•      0> 

• 

• 

■ 

• 

. 

• 

^ 

CP 

OJ   3 

3 

o 

o 

o 

o 

o 

o 

o 

3 

H-  d- 

►-•0 

1 

1 

1 

1 

t-* 

1—' 

t-k 

o 

CO 

t— » 

W< 

a  3 

CO  M 

1 

1 

1 

1 

o 

o 

C?i 

o 

CO 

CO 

CJ  0» 

CP 

CP    < 

CD  a> 

CO 

o 

C7) 

o 

CO 

Ui 

Cfl    »^ 

3 

a  0) 

Oi 

h* 

CO 

c+ 

*^ 

0 

07 

d-  JO 

Ms  K- 

H> 

3 
CO  o 

d- 

1 

Cfl 

< 
00 

0    OJ 
^^   3 

3: 

. 

. 

. 

. 

. 

. 

• 

. 

• 

• 

•<    CP 

H-UD 

^ 

O 

CD 

o 

o 

o 

o 

o 

o 

o 

o 

o 

(— » 

Cfl 

CO 

c 

y-^ 

i-*  CP 

a 

t-* 

t-» 

CO 

CO 

CJ 

iCk. 

-^ 

^ 

CD 

CO 

d-  CP 

d- 

» 

a> 

O 

H- 

.(k 

CO 

I-' 

•^ 

CO 

C71 

o 

I-' 

•u 

en 

a>  X 

K- 

^ 

a- 

M 

o 

t-» 

I-* 

1^ 

CO 

CB 

cn 

I-* 

CO 

-0 

ro 

3  'O 

o 

CP 

O 

1—' 

C  X 

H 

0> 

CP  ^ 

Cfl 

a 

0> 

CP 

W 

> 

*^ 

d-  £» 

Cfl 

a  *- 

w 

d 

H-  3 

CP 

CA    CO 

o 

O    CP 

M 

CO  0)    3 

0) 

CO    Q. 

CP 

*< 

3    CP 

CO 

CA 

>*-<' 

< 

3 

a  a 

• 

(P 

CP 

d- 

•«>' 

M 

CO 

h-" 

3-  CO^ 

^«* 

c 

CP 

«<  ?o 

M- 

• 

. 

. 

• 

. 

• 

^ 

3 

> 

d- 

Cfl 

3 

o 

o 

o 

o 

o 

o 

O 

3 

1— 

d-  CO 

1 

1 

1 

1 

o 

o 

k-* 

o 

CO 

H- 

S3> 

0 

(D 

0 

CP  t£J 

►*] 

1 

1 

1 

1 

C71 

-J 

--5 

o 

•(^ 

H» 

a  cr 

H» 

1—' 

3   C 

Mi 

CD 

•^ 

I-' 

o 

C71 

^ 

Cfl    Cfl 

x; 

o 

CP   Q) 

. 

CO  > 

Cfl 

0 

d-  ►^ 

»-* 

d- 

cr 

(-»- 

CO 

^1    CP 

CO 

O  BJ 

Cfl 

Cfl 

c+ 

H-  a 

CD 

CO  (P 

d- 

0 

CO 

O  -' 

•^ 

. 

. 

. 

. 

. 

• 

• 

• 

. 

• 

V!   < 

M- 

I-" 

\ 

Cfl 

o 

o 

o 

o 

o 

o 

o 

o 

o 

o 

Cfl    • 

Cfl 

c 

0 

Q) 

o 

t— ' 

t— ' 

(-* 

^^ 

CO 

.> 

>^ 

-J 

C» 

d- 

d- 

d- 

0> 

>  3 

C31 

ro 

CO 

00 

CO 

CO 

-0 

K* 

CO 

CO 

CP   CP 

H- 

(P 

Cfl 

CL  a 

I-' 

OJ 

>— ' 

-J 

CJ 

CO 

CO 

•^ 

ro 

00 

xplalned) 
metrics 

O 
Cfl 

CP 

< 

£» 
d- 

M- 
O 

3 

CP 

Percent  Absolute 
jacent  DRGs 

01 

o 

c 

n 


n 


CO 

OD 


3    C  iS    CD 

3  a      a 
n  a  01  t 

(*   •     e* 

a      » 
►1 


a  3 


3    c+ 


"^  X  01 

•<  o  a> 
h-  (/I  »- 

CDT3    3 
CD  I—  r+ 

fl)    < 

D 
O 

rti 

3 


•0 

01 

•< 

» 

►1 

u 

H 

m 

T 

X 

01 

0 

3 

*— 

W 

c 

•->> 

a 

» 

Ci 

1 

w 

o 

D    "I 
CD 

r* 


•0    CI 

II    c 

O  ft> 


1— 

« 

03 

CO 

c+ 

1 

OD 

CI 

.tK 

» 

t 

0) 

H- 

(A 

O    01 
O 

cr  -I 

(-►   M 


ffl  - 
ir»  1- 


-^  SC  01 
■<  o  ai 

CO-O    3 
00  M-  r+ 

0)    < 

3 

n 

A 

3 


tJ 

!V 

"< 

(S 

►1 

U) 

-3 

m 

n 

X 

0 

o 

3 

^— 

M 

c 

Htl 

a 

a> 

ft 

fi 

to 

w 

'-  3 

-  3 

oi  m 

ui  m 

o 

o 

(KJ    p-c 

(R    -1 

1    01 

n    01 

c  a 

0  o 

C    PD 

C    ?D 

-0    T) 

•O    -0 

W    01 

w  01 

o 


< 
0) 


o  ro 


O  OJ 


O 
y 

ft  o  >- 

M  0)  - 

<-►  to  ro 

ft  J>. 

•O  CD  CT) 


>-  ft    CD 

<o  f*  *1 

OS         ft 

tn  CD    £ 

cr  ^-  w 

•    f* 

ft 

1 


>■ 
cr 
a  o  •- 

Q)  - 

■O  U  lb 

0?  ft  05 

t—  U  -J 

3 


""I  I  01 

■<  O  CD 
I—  W  I— 
ceo  3 
00  M-  c+ 

»    < 

3 

A 

3 
rf 


M  X 

0)  o 

3  f- 

U  C 

ft  ft 

1-1  M 
U 


>-^ 

3: 

tn  m 

o 

n 

»— t 

►1 

01 

0 

o 

c 

SI 

•0 

13 

W    01 


3 

o 

3 

3    »! 

II 

II    f* 

t;i 

A  01 

00 

U3   H- 

h- 

r* 

■< 

< 

< 

CD 

0) 

h— 

t— ' 

C 

c 

•0    ft 

•O    ft 

II     II 

i1     n 

A  I-* 

/\   ^— 

o  o 

o  w 

•       OJ 

■       OJ 

o  ►- 

O  ro 

»-*  ►— 

►-  -J 

w 

o 
c 
» 
n 


3: 

\  D 
>  ►-I 
r-  o 
c-  > 

m 


\  70 
Z  t- 
03 
-5  3 

o 


01 

> 

3 

m 
o 

01 

o 

►D 

»— t 
O 

z 


<  z 

>  o 
33  m 

>  m 

D3  z 

r-  o 

m  w 

z 


<  O 

>  P3 

33  tl 

HH   PI 

>  z 

03  O 

c-  m 
m  z 


C3 

> 

01  z 

H  z 

>  1 

»— )  I 

01   .-H 

-3  H 

l-H   Z 

o  m 

•< 


na 

m 

33 

■ii 

O 

> 

z 

^  o 

n  m 

> 

01  o 

P3  '^ 

r-3 

-3 

m  t=q 

> 

<  o 

OD 

pj-" 

n 

cr  03 

P3 

-     CD 

33 

CJ 

'*!  tJ 

HH  01 

yi 

33 

01  -< 

-3  Z 

33  T3 

PI  33 

<  PI 

-<  O 

p]  "-. 

C  O 

>^  H 

o 

3 
O 
33 

> 
r- 

H 
•< 


0 

c 
"-( 
o 


a       u) 

H-  (B 

9  3 


Q. 

n 

>-^ 

A 

3 

a 
a 

<-f 
o 

c 
u 


3: 

o 

cr 


"< 

n 

•-h 

(ti 

U 

r* 
0 


c 
►I 

A 

e* 
O 

>l 
(I 
O 
O 
< 

»i 

cr 
v: 

w 

o 

0 

3 

a 

M 
< 


Z  C  «    CO 
M-  3    <-f  1 

3  a      a 

A    01    0)    £ 

O'  r+  HJ  CO 

a      (t 
►1 


X 
•0    c+  O 
jj    h,    o> 

H-  »  w  CD 
3  3  (S  ~J 
^  H*  M    O 


-"iitri 

•<  0 

a> 

>-  Ul 

K- 

ceo 

3 

00  ►- 

f+ 

W  r^ 

01 

< 

»— ' 

K- 

3 

0 

a 

3 

e* 

•0 

0) 

x: 

(f 

►^ 

Ul 

-^  o  m 

►1 

(t 

X 

0) 

0) 

o 

3 

r+ 

^- 

U 

3-  C 

i-b 

CA 

a 

a 

a 

►% 

01 

u 

u 

3 

a 

w  tn 


•2. 
o 

3    1 

II  cr 

>-'  h^ 

o  a 

»-*  ^-- 


•o  en 

II  c 
0) 


-'  »    03 

00  t+  1 

CO         (I 
J^  0)    £ 

01  ^^  01 
•     c+ 

a 
HI 


O    'fi 

o  o  1- 
cr  «^  m  - 
n 

09     0    U    CJ 

3-  w 


M 

(I    00 


•*5 1  w 

-<  O  0) 
>-  in  H^ 
CDT3  3 
CD  M-  (-•■ 

0)   < 

1—1   K- 

3 
O 
<X> 

3 


T) 

o> 

>< 

a 

*^ 

Ul 

"-3  o  m 

1 

(» 

X 

01 

Q) 

o 

3 

t+ 

h-* 

U 

3-  C 

H> 

U 

a 

» 

0 

►1 

0) 

w 

U 

3 

a 

—  3 

o 

OQ     l-H 

0  Cl 
C  S3 
•O    TJ 

w  tn 


o 

3* 

rt  n  — 
01  01  - 
r+  ta  ►- 
f5   00 
"D    «    CD 

0 


CD  f+  •-! 
00  (I 

on  0)    £ 

cr  ^  W 

•     r+ 

»1 


o 

c 

S3 

n 


> 

cn 

cr 

> 

a  a 

H' 

2 

0) 

- 

-0 

"0   w 

Jk. 

r^ 

0)    Oi 

^3 

M 

1—  w 

^ 

3 

-II  I  w 
-<o  a 

>-•    01     K- 

CD-O    3 
CD  K-  c+ 

01    < 

1— '  K- 

3 

n 
(1 

3 


>    H 


en 

o 
c: 

n 
m 


w 

01 

"< 

a 

H 

u> 

Ham 

n 

(t 

X 

a 

01 

o 

3 

r* 

t— ' 

ca 

■y  c 

i-b 

CO 

a 

n 

(» 

11 

0) 

CO 

CO 

3 

a 

3 

m 
\  o 
>  p-i 

cr  > 

S3 


-3 
\  S3 

Z    H-l 

O  3 
H  3 

m 
o 


tn 

> 

3 
•V 

r- 
m 

a 
m 
en 
o 

S3 

HH 

■D 

-3 
I— ( 
O 

z 


^^ 

3 

v\  PI 

o 

OQ 

»-4 

in 

en 

0 

o 

c 

w 

•0 

TJ 

01  tn  H 


3 

3 

0 

0 

3    ►^ 

3 

3    *S 

II  cr 

II 

II  cr 

ro  K- 

►— * 

W  ^~ 

cD  a 

C/l 

o  a 

CJ   H- 

OJ 

.^^  1— 

r+ 

<-•■ 

•< 

h: 

< 

< 

< 

0> 

01 

01 

*— ' 

1—' 

y— 

c 

c 

C 

"0    ft 

•o 

A 

V    Cl 

II    II 

n 

II 

II     II 

/\  ~) 

/v 

.b 

A     -J 

o  *• 

O 

o 

o  o 

itk 

ro 

•    o 

o  cji 

o 

CD 

o  en 

1-  o 

»-• 

•b 

H-     O 

<  z 

>  o 

S3  m 

i-c  TJ 

>  p: 

03  z 

C"  c 

P3  n 

z 


<  O 

>  PI 
S3  tl 
>-H  PI 

>  Z 
03  O 

r-  P5 

P3  Z 
-3 


c:3 
> 
tn  z 
H  z 
>  I 

en  .-I 

-3  -3 

—•  Z 

O  P5 

■< 


P3 
S3 

O 


> 

■~  z 
n  o 
>  PI 
en 

n  o 
'n 
r^ 

5g 

PI  o 

r"  <— 

•    en 

a 

-i]  S3 

*— t  "^ 
S3  en 
en 

-i  HH 
Z 
S3 

P3  13 
<  S3 

«-i  m 

P3  O 

^  n 

-3 
t— * 

Z 

n 

3 
O 
S3 
03 


> 
03 

r" 

P3 


-3 


o 
c 
>^ 
o 
n 


n 


CO 
00 


3  cza  w 

3  a      9 
a  0)  o>  £ 

a  ■    c+ 

a      (t 

1 


w 

X 

"0    (t    W    CD 
0)    3    (»    00 
I—  ►*  U    CJ 
3   r* 
—  << 


"»]  I  cn 

-<  0 

a> 

>-  w 

»-*- 

CDT3 

s 

CO  ►- 

rf 

CJ  f* 

01 

< 

h-^ 

H*- 

3 

O 

a 

D 

o 

OQ 
u 


o 

^^ 

tn 

^ 

Hi  S  H 

O 

ffi 

o 

►1 

M- 

f+ 

OQ 

o> 

(A 

3- 

K- 

C 

(T 

cr  (I 

OCl 

a. 

►1 

o 

0) 

c 

< 

T) 

rt 

U 

to  c^-  "I 
00         A 

a  >-•  w 

■     cf 

(» 


0  3- 

i-t>  o 

cr  f*  n>  - 

1  3  U  H- 
Q  0  (t  OD 
Q)  U  U  Oi) 
C+  01 

3- 


'*]  I  cn 
•<  o  a 

I—  W  K- 
CO-O  3 
00  *--  r* 

0!    < 

H-  ^^ 
3 
O 
A 
3 
r* 


•0 

•o 

CD 

01 

•< 

•< 

(tl 

a> 

1^ 

"1 

M 

u 

H  o  m 

-sow 

►1    A    X 

►1    (t    X 

»   0)   o 

£»    0»    O 

3    !-f  1- 

3    f*  1- 

w  zr  c 

W    3-  C 

Hfc  ta  a 

►^>  CB  a 

»       (» 

A          A 

►I   »   w 

•^    »    CB 

W    3 

W    3 

a 

a 

y  m 

V  m 

o 

o 

«      -H 

«  .-. 

T  05 

1    10 

0  a 

O    C3 

C    33 

C    PO 

•0    "13 

T)    -0 

CD     Ui 

W    CD 

o 

3- 
0> 

'^ 

(A 
A 
(A 


O  N3 

•-b  «  H 

0  A    O 

n  a 

01  3"  t— 

C    c* 

cr  A 
at)  a 

O    0) 

c  < 

•0    A 


O 

3* 

A   O  H- 

CD    B   . 

A    >te 
•O    01    05 


oo  ►- 

O  h- 

H.  s:  H 

O    A    O 

«  rr  •— 
c  c* 
cr  A 
OQ  a 

o  » 
c  < 

•O    A 
09 


A 

C»  11 

CD  c* 

»1  II 

00 

A    II 

A  01 

X    II 

cr 

H- 

01     II 

f*  II 

A    II 
H    II 

> 

cr 

an 

^^   II 

0) 

M 

•0 

01 

1^    II 

a 

A 

O)    11 

H- 

U 

-J    11 

3 

^ 

II 

•»!  as  t/2  II 

•<  0 

tt     II 

h-» 

u 

^—    II 

CO-O 

3     II 

00 

M> 

r+   II 

CJ  (f 

01 

<  11 

►— 

^—   II 
3     II 
O    II 
A    11 
S    11 
r*  11 

>  II 
>-•  11 
^^  II 

•0    II 

a  II 
<<  11 

A    11 
»1    11 
0)    11 

fl 

H  m  II 

T 

X     II 

a> 

O    II 

3 

H^    11 

U 

C     11 

t-h 

a  11 

A 

A    11 

1 

U    11 

01 

cn 

O    H 

OQ 

l-l    II 

1 

CO  11 

0 

C  n 

c 

PO    M 

•0 

TJ  11 

u 

CO  11 

o  11 
y  II 

0     11 

►1  11 
oa  11 

A    II 
U    11 

ro  11 

<j3    II 

►-» 

^^  11 

►^ 

■^  11 

►- 

en  II 

i-b 

«:  H  ii 

0 

A 

O    11 

►« 

H- 

f+  11 

OQ 

ff  II 

u 

3- 

>-•  11 

c 

r* 

C  A 

OQ 

a 

►1 

O 

V 

c 

< 

•0 

w 

A 

to 
O 

c 
X 

o 

M 


\  33 

Z  i-< 

03 

H  3 

M 

O 


CO 

> 

T) 
C" 

pq 

o 

en 
n 

33 
•T3 


P3 

3) 

•»: 

o 

33 

•-  DC 

n  > 

>  z 

H 

CO  O 

> 

m  n 

CD 

ro 

c=q 

P3  rq 

< 

u 

M2 

• 

r-pi 

-J 

-   o 

HH 

•»jcn 

H-n 

3)3) 

H-l 

COT) 

<z 

HW 

Sm 

SJt- 

>-i  T) 

nz 

>  m 

< 

CD  z 

i-fT) 

r  o 

P3  35 

m  PI 

xm 

z 

w  o 

H 

o 

z 

<o 

CI 

>M 

33  13 

f^ 

t-i  m 

X 

>  z 

> 

00  o 

» 

cm 

o 

wz 

n 

H 

w 

£1 

c 

>33 


S 


O 

o 

<5 

>  o 

3)  M 


en 

0 

c 

o 
(I 


o 


a 


CO 


a  o 


sr 


u 
(» 
o 

0 

3 

a  3 
o 

a  c 

<    H* 

>^  a 
"< 


3  a  (t  tD 

I-'   D    Pf  ►! 

3  a  a 

m  Q)  Qg  £ 

o  f*  •—  w 

(I   •  c-f 

a  a 
►1 


X 

c+  a 

»0  (I  en  to 
0)  3  a>  CD 

3  <+ 


•<  o  ss> 

P—    U     M- 

tO'O    3 

00   h^   C* 

BJ    < 

>-•  ►* 

3 

O 

a 

3 


•0 

0) 

<< 

(i 

►s 

u 

H  o  m 

1 

(J 

X 

a 

0) 

o 

3 

f* 

h^ 

M 

rr  c 

Ml 

CO 

c 

m 

(I 

•1 

01 

u 

tA 

3 

a 

o 
o  o 

C    33 
W    CO 


O 
3- 
P) 

9 


to  ^ 
-J  -J 

M.  C  H 
O    «    O 

•1    >-  c* 

n  d> 

C   r* 

cr  (5 
OQ  a 

0  a 
c  < 

•0   (» 
en 


N5 

a  (£> 

C+   00 

w    t  -> 

(»  ^  to 

n  (+  3" 

O   3"  0 

3       i-«  n  I- 

Q.  3    r+  D)   - 

O    3    W    >- 
f^    1    (I    (t    CD 

<     M-    W 

1-  a 
(I   ^*  0 

cr 
•1 
a 

0) 
r>- 

3- 


0 

CD  N5 

»1 

^  OJ 

tn 

S  H 

c 

»    0 

cr 

M-   (-*• 

OD 

OQ    0) 

1 

3-  1- 

O 

c^- 

c 

(» 

T) 

a 

ca 

a 

< 

9 

I—  (I    03 

tC  f+  --I 

OO        a 

J^  »    £ 

£B    ►^  « 

•      c* 

O 

►1 


rt-  05 

(I  H-  "^ 

O  c+  O 

O  3-  3- 
3  (l    O  >- 

a  3  w  »  - 

0    f+  01    >- 
h)    (^  (J    00 

(t  a''o  w  CD 

<  H-  CB 
h*  Q.  h^ 
(J     H-   3 

«    PC  -' 

•< 

-^  3!  U3 

•<  O    » 

►-  M    ►- 

QD-O    3 

CD  M-  ft 

u  rt- 

0)   < 

h-'  I— 

3 

O 

D 


•0 

0) 

v; 

A 

►< 

lA 

H  O  ti5 

►1 

(t 

X 

01 

a 

0 

3 

<r^ 

H^ 

w 

■y  c 

•-b 

ca 

a 

(» 

0) 

1 

V 

in 

cs 

3 

a 

o» 

H- 

-0 

'— 

Hi 

x:  H 

0 

c» 

0 

•n 

!->• 

r* 

n 

Q) 

en 

3- 

h- ' 

c 

c+ 

cr  (t 

OQ 

a 

1 

0 

D> 

c 

< 

•0 

(t 

w 

00         (» 

vat 

cr  i-i  tn 

■    f+ 

a 


r*  O 

u       --• 
(»   S   > 

0  M-  cr 
o  c+  a 

3  3*      o  •- 
a     TJ  0)  - 

3    Qg    U    ^ 

1  O    •—  »    t\3 

(»  1-1  3  a  *. 
<  0-  — 


(t  a 


•»i  I  cn 

-<  O    tti 

I-*    to     K- 

tO'O    3 
CD  M-  c+ 

0    < 

I—'  y^ 
3 
O 
(» 

3 


tt 

>< 

a 

►1 

CA 

-3  o  m 

►1 

a 

X 

0) 

Q) 

o 

3 

c+ 

H- 

« 

3- 

c 

It) 

Ctt 

a 

(t 

rti 

»1 

0 

01 

W 

3 

a 

—  2 

-^  2 

ui  m 

CI  W 

o 

o 

OQ     I-. 

OQ    -1 

►^  en 

■t  cn 

0  a 

o  o 

C   33 

C    33 

T)    -0 

•0  nj 

w  en 

ui  en 

n 

3- 
0) 

OQ 

01 


en 
o 

c 

33 
O 

m 


en 

> 

s 

T3 

n 

pq 

en 

O 

c 

33 

O 

W 

3 

W 

^  O 

>  >-i 

r"  o 

n  > 

33 

t=q 

H 

^  33 

Z  1-1 

03 

H3: 

m 

o 

- 

H- 

00  ,1^ 

ui  Oi 

H,  x:  -3 

0  m  0 

►1    H-  (^ 

n  <» 

W  3-  ►- 

C   c* 

cr  01 

OQ  a 

*\ 

o  » 

c  < 

•0   » 

w 

en 
> 
3: 

t) 
c- 
t=q 

o 
m 
en 
o 

33 

I— I 

•T3 

o 

z 


<  z 

>  o 

S3  P3 
i-TI 

>  m 

DS  Z 

r-  o 
m  PI 


w 


IS 
33 

O 


> 
'—  Z 

o  o 
>  m 
en 

M  o 
11 
r- 
m  3 

<  m 
tn  o 
r  i-H 
-  en 
tn  o 
m  33 
n  13 
O  en 
z 

O   l-H 

z 
w 
P3  n3 

<  33 
I— I  P3 
P3  O 
S  i-i 

w  a 

I— ( 
z 
o 

o 

> 

33 
M 

en 


> 

03 

r- 

P3 


en 
o 

c 
1 
o 


n 


05 


►-  -< 

CD  O 
(X  C 

y  3 

OQ 


CD 

a,  - 

n  fo 


T)  *: 

n 

(t 

►— 

D 

U 

to 

3 

(* 

CD 

u 

(D 

w< 

1^ 

h-- 

D 

< 

a 

3 

3 
3 

a 


c 

3 
f+ 
►1 

^*- 
3 
3 

a 


'TJ>  CD 

'V 

3  to   1- 

3 

n  01  3 

o 

W    M-Oq 

CO 

OQ    t- 

3    (I 

3 

(» 

3 

f+ 

o 
en 


y 

yi 

--1 

-J 

05 

to  O) 

CO 

«  G  C  C 

« 

3 

a 

3 

M- 

£ 

H- 

t 

n 

9 

OQ 

a 

y 

^- 

3- 

►— 

c+» 

r^OQ 

a 

3-  a 

3- 

a  c* 

a  f* 

A 

» 

cr  a  cr  a 

«< 

«< 

•0 

•0 

f* 

f* 

« 

u 

CO  a 

OB  I— ' 
(j\  o 

9 


■  CJ 

o 
n  o 
a  - 

M  ^ 
»  N3 
CD    N3 


CO    k* 

N3  rr 


a 
o 

0! 
9 


H  C 
►1  3 
H*  r+ 
3  ►! 
3  >- 
A   3 

as 
a 


o 

0 
(A 
c+ 
U 


N3  >-• 
03  Cjn 


cn 
o 

c: 

n 


cn 

> 


cn 

O 

c 

cn 

33 

> 

n 

s 

m 

TJ 

n 

n 

o 

m 

cn 

n 

33 

1-^ 

§ 

\  o 

HH 

>  "-1 

o 

c-  o 

z 

r*  > 

g 


\  3) 
Z  1-1 
O  3 

PJ 
O 


<z 

>  o 
3)  m 

>  m 

CD  Z 

r-  o 
m  m 

z 


n 

3J 

•«j 
o 

> 
z 
o 
n 

o 

n  > 
>  H 

cn  t-i 
P3  m 
z 
n  H 
m 

<2 

pq  > 

r  z 

w    > 

PI 

3 

n 


> 

a 
o 

3> 

n 

CO 


-3 
> 
CD 


Table  3.10 

CCMPAlISCN  of  the  VARI7VnCI«J  (R-SQUARE)  TN   COST  OF  INDIVIDUAL  CASES 
EXPLAINED  BY  THREE  CASE  MIX  MEASURES 


Diagnosis- 
Related 
Groups 


Patient 
Managonent 
Categories 


Disease 
Staging 


Trijimed 


0.30 


0.26 


0.17 


Untrimned 


0.17 


0.15 


0.10 


*  Cost  data  are  untrlrmed  and  exclude  DRGs  468  and  470. 
Source:  Galore,  unpublished  data  1985. 
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TABLE  3.13 

EFFKCT  ON  HOSPITAL  PAYMENT  OF  USING  SYSTEMETRICS  REFINEMENTS 

INSTEAD  OF  DRGs 
For  Seventeen  DRGs,  by  Hospital  Characteristics 


Hospital  Characteristic 


Ownership   (Omitted  class: 

Proprietary 

Public 

Size   (Omitted  class: 

Small  City 

Medium  City 

Large  City 
2nd  quarter 
3rd  quarter 
4th  quarter 


City 


PFS 
PP3 
PPS 


Voluntary) 


Rural) 


Trauma  center 

Number  of  beds  (change  per  bed) 
Resident/bed  ratio 

Proportion  of  Medicaid  Admissions 
Inner  City 

Interaction  of  Prop,  of  Medicaid 
Admissions  and  Inner  City 

New  England 

Mid  Atlantic 

Florida 

Illinois /Michigan 

Indiana/Ohio /Wis con . 

Mississippi 

Kentucky/Tenn . /Alabama 

Neb./N.  Dak./S.  Dak. 

lowa/Kansas/Minn. /Missouri 

Texas 

Ark. /La . /Oklahoma 

Utah 

Ari".  /N.  Mex./Nev. 

Idaho/Montana/Colorado/Wyom . 

California 

Alaska/Hawaii 

Oregon/Washington 

Estimated  datal 
Estimated  data2 
Intercept 
R  Square 


Change  in  Payment 

Per  Case  ($) 

18, 

,00 

15. 

00 

0. 

00 

-10. 

.00 

-3. 

00 

24. 

,00* 

-14. 

00 

-30. 

.00* 

-56. 

00* 

0, 

.  07** 

341. 

00** 

98, 

,00 

45. 

00* 

-216, 

.00 

92 

.00** 

52, 

,00 

29, 

.00 

14, 

,00 

-9 

.00 

-30, 

,00 

-12 

.00 

7, 

.00 

46 

.00* 

43, 

.00* 

4 

.00 

109 

.00* 

50 

.00 

34 

.00 

21 

.00 

407 

.00** 

78 

.00** 

15 

.00 

71 

.00** 

-68 

.00** 

0 

.054 

*P<.05. 
**P^"  .01 


Notii   Results  based  on  provider  level  models  weighted  by  number  of 

bills.   See  Table  3,4  for  a  list  of  ADRGs  included. 
Sou  ■,.-.> 0-.  Cretin  et  al  .  19S6 


Table  3.14 

RELATIVE  CHARGES  FOR  BENEFICIARIES  WITH  EIGHT  CHARACTERISTICS 

Coirpared  to  Otherwise  Similar  Beneficiaries 

Controlling  for  DRG  and  Hospital  in  which  Care  was  Rendered 

5  Percent  Sanple  of  1983  Medicare  Discharges 


Beneficiary 
Characteristic 


Relative 
Charge 


1.  Nursing  Home 
Resident. 


1.062*** 


2.  Hospitalization 
in  Previous  Year. 


1.033*** 


3.  Medicaid 
Enrol lee 


0.985** 


4.  Aged  079). 


1.038*** 


5.  Very  aged  084) 


0.987* 


6.  Younger 

disabled  <65) 


0.993 


7.  Younger  endstage 
renal  disease. 


1.052** 


8.  Older  disabled. 
064) 


1.067*** 


*  =  p  <.05  that  ratio  is  =  1.00;  **  =  p  <.01  that  ratio  is  =  1.00; 
***  =  p  <.00l  that  ratio  is  =  1.00,  using  t  test. 

■'■  Based  on  10  random  sanples  of  60  DRGs  each. 


Source:  Jencks  &  Kay  1986, 


CHAPTER  4 
HOW  DO  COSTS  OF  TREATMENT  DIFFER  BETWEEN  HOSPITALS  AND  WHY? 

The  last  two  chapters  examined  case  mix  adjustment  at  the  individual  case 
level.  We  concluded  that  some  increase  in  precision  of  case  classification  is 
possible,  but  that  no  system  is  currently  available  to  either  replace  DRGs  or  to 
provide  a  strong  severity  adjustment  to  the  DRG  system.  We  suggested  that  work 
already  in  progress  promises  to  improve  case  mix  measurement  as  it  applies  to 
nonoutlier  cases  and  that  severity  measures  may  do  more  for  our  understanding  of 
outlier  cases  than  of  typical  cases.  But  we  also  pointed  out  that  the  severity  model 
must  be  supplemented  by  an  understanding  of  the  issues  of  intensity  of  care  if  we 
are  to  understand  differences  in  cost  per  case. 

Chapter  1  pointed  out  that  hospital-level  measures  are  an  important 
complement  to  case  mix  adjustment  at  the  individual  case  level.  In  this  chapter  we 
examine  ways  in  which  hospital-level  adjustments  may  be  able  to  achieve 
refinements  which  are  impractical  at  the  individual  case  level.  Section  I  describes 
the  hospital-level  adjustments  which  were  included  in  PPS  and  why  they  are 
constructed  as  they  are;  Section  II  provides  a  model  of  why  hospitals  have  different 
case  mixes;  Section  III  examines  the  empirical  evidence  on  hospital  differences; 
Section  IV  looks  at  the  empirical  problems  involved  in  setting  prices  which  will  be 
used  across  hospitals  and  how  these  issues  influence  our  ability  to  compare  hospitals; 
Section  V  analyzes  information  on  the  winning  and  losing  hospitals  during  the  first 
year  of  PPS;  Section  VI  provides  a  parallel  examination  of  the  impact  of  the  outlier 
payment  system  in  the  first  year  of  PPS;  Section  VII  discusses  new  ways  in  which 
hospital-level  adjustments  might  be  made;  Section  VIII  examines  the  evidence  that 
larger  hospitals  have  higher  reasonable  costs  than  smaller  hospitals;  and  Section  IX 
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summarizes  evidence  as  to  whether  the  current  case  mix  adjustment  system  is 
sufficiently  precise  to  support  national  payment  rates  without  a  hospital-specific 
component. 

1.  THE  PPS  MODEL  OF  INTERHOSPITAL  CASE  MIX  DIFFERENCES 
In  Chapter  1  we  described  the  way  in  which  PPS  pays  hospitals.  PPS  procedures 
can  be  restated  as  a  theory  or  model  of  how  costs  should  differ  across  hospitals. 
PPS  implies  a  model  in  which  five  variables  (labor  costs,  teaching,  urban/rural 
location,  case  mix,  and  outlier  cases)  should  determine  hospital  costs  and,  therefore, 
will  determine  Medicare  payments.  In  addition,  the  model  implied  that  actual  costs 
and  geographic  region  were  to  be  considered  only  to  cushion  the  shock  of  transition 
and  were  not  allowable  sources  of  difference  in  the  long  run.  In  theory,  these 
variables  determine  payment  through  a  purely  mathematical  and  objective  analysis 
of  recent  hospital  costs.  In  fact,  a  number  of  judgments  were  necessary  to  shape 
the  payment  formula.  Examining  these  variables  and  the  judgments  which  were 
made  clarifies  how  an  implicit  model  of  hospital  differences  results  in  a  particular 
payment  policy: 

A.  URBAN/RURAL  LOCATION.  The  urban/rural  adjustment  is  based  on  the 
empirical  observation  that  per-case  costs,  after  adjustment  for  wages,  CMI, 
teaching  intensity,  and  bedsize,  are  substantially  lower  in  rural  areas.  Although  this 
difference  might  reflect  lower  costs  of  doing  business  which  have  not  yet  been 
identified,  more  likely  causes  are  use  of  simpler  technology  and  treatment  methods 
and  referral  of  more  difficult  and  costly  cases  to  urban  medical  centers.  The 
designers  of  PPS  made  several  decisions  in  addressing  this  problem: 

1.  They  decided  to  treat  location  as  a  dichotomous  variable  rather  than  as  a 


1+-2 


continuous  variable  (based,  for  example,  on  population  of  the  MSA  in  which  the 
hospital  was  located).  This  decision  was  made  despite  the  fact  that  the  econometric 
equations  on  which  the  calculations  were  based  used  a  four-level  measure  of 
location.  The  most  important  reason  for  this  decision  was  the  complexity  of 
maintaining  multiple  levels  and  the  evidence  that  no  simple  continuous  variable 
would  work  well.  But  even  if  the  underlying  typology  of  urban  and  rural  hospitals  is 
appropriate  for  payment,  there  is  very  likely  to  be  a  continuum  between  the  two 
types.  Since  the  standard  payment  amount  is  higher  in  urban  areas,  some  hospitals 
classified  as  urban  may  be  paid  too  much  because  they  are  on  the  border  of  being 
rural  and  some  which  are  classified  as  rural  will  be  paid  too  little  because  they  are 
on  the  border  of  being  urban  (Table  4.1A).  This  boundary  problem  is  one  of  the  more 
difficult  technical  issues  in  PPS. 

2.  A  number  of  small  but  relatively  sophisticated  rural  hospitals  pointed  out 
that  they  would  do  very  poorly  under  this  dichotomy,  so  Congress  established  the 
concept  of  regional  referral  centers,  which  would  be  paid  at  urban  rates  despite 
their  location  (about  160  hospitals  are  so  designated). 

3.  On  the  other  extreme,  a  number  of  small  rural  hospitals  argued  that  they  did 
not  have  the  flexibility  to  adapt  to  national  rates,  so  Congress  established  the 
concept  of  sole  community  hospitals,  which  would  retain  75  percent  hospital- 
specific  payment  and  would  not  be  moved  to  national  rates  (about  360  hospitals  are 
designated  as  sole  community  hospitals. 

In  addition  to  further  discussion  of  this  issue  in  this  report,  the  Department  will 
deliver  to  Congress  a  separate,  comprehensive  report  on  the  urban-rural  issue. 
B.  LABOR  COSTS.    Labor  costs  are  measured  by  a  direct  survey  of  wages.    We  focus 
here  on  the   the   calculation   made   to  determine   the  relation   of  the   area   wage 
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level  to  hospital  costs,  and  hence  to  Medicare's  payment  rate. 

Table  4.1  A  shows  that,  controlling  for  case  mix  index,  hospital  size,  location,  and 
teaching,  hospital  costs  rise  0.84-1.02  percent  for  every  1.0  percent  increase  in  the 
area  wage  level.  The  architects  of  PPS  calculated  that  local  wages  influence  79.15 
percent  of  hospital  costs  (about  60  percent  in  direct  labor  costs  and  about  20 
percent  more  in  purchase  of  local  services  which  are  influenced  by  local  labor  costs) 
and  decided  that  79.15  percent  of  PPS  payments  would  be  adjusted  for  local  labor 
costs.  This  meant  a  0.79  percent  rise  for  every  1.0  percent  increase  in  the  area 
wage  level. 

The  reason  that  cost  per  case  varies  more  with  the  area  wage  level  than  the 
calculated  79.15  percent  is  that  the  area  wage  level  is  also  a  proxy  for  other  factors 
associated  with  higher  hospital  costs  which  are  not  measured  directly.  Those 
factors  probably  include  additional  operating  expenses  related  to  urban  location  and 
an  association  of  high  labor  costs  with  a  more  intense  style  of  medical  practice.  A 
clear  example  of  the  way  in  which  the  wage  index  can  serve  as  a  proxy  for  other 
factors  can  be  seen  by  comparing  Table  4. IF  with  Table  4. IB.  Adding  urban/rural 
location  to  the  equation  decreases  the  apparent  relation  between  cost  per  case  and 
wage  index.  When  urban/rural  location  is  not  considered,  the  cost  of  a  case 
increases  1.43  percent  for  every  1.0  percent  increase  in  the  wage  index  because  the 
lower  wages  in  rural  areas  are  also  serving  as  a  proxy  for  the  less  expensive  practice 
patterns  in  rural  areas.  When  the  relation  of  cost  to  wage  index  is  adjusted  for 
differences  between  urban  and  rural  location,  the  cost  of  a  case  increases  only  0.98 
percent  for  every  1.0  percent  increase  in  the  wage  index. 

The  designers  of  PPS  chose,  based  on  evidence  as  to  what  the  effect  of  wages 
on  cost  per  case  should  be,  to  regard  the  remaining  effect  as  a  kind  of  variation  for 
which  Medicare  should  not  pay.     They  did  not  choose  to  use  the  wage  index  as  a 
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proxy  for  unmeasured  case  mix  variation.  The  decision  to  apply  the  local  wage 
index  to  79.15  percent  of  hospital  costs  was  intended  to  reflect  the  actual  way 
hospitals  purchase  labor  and  labor-related  services,  and  to  avoid  a  perverse 
incentive  to  hospitals  to  raise  their  wages.  A  secondary  effect,  however,  was  to 
make  PPS  a  somewhat  less  exact  model  of  actual  differences  between  hospitals. 
C.  TEACHING  ACTIVITY.  Teaching  activity  is  measured  by  the  ratio  of  the  total 
number  of  interns  and  residents  to  the  total  number  of  beds  in  a  hospital.  But  the 
way  that  this  measurement  is  used  to  determine  payment  again  reflects  judgment. 
When  the  architects  of  PPS  calculated  the  effect  of  teaching  on  costs  they 
simultaneously  calculated  the  effects  of  hospital  size  (measured  by  total  beds). 
When  they  controlled  for  hospital  size,  the  effect  of  teaching  on  cost  per  case  was 
much  lower  than  when  hospital  size  was  omitted  from  the  calculations.  That  is, 
some  of  the  higher  cost  seen  in  large  teaching  hospitals  was  also  seen  in  large  non 
teaching  hospitals. 

The  architects  of  PPS  decided  for  a  variety  of  reasons  that  the  tendency  of 
increasing  hospital  size  to  increase  cost  per  admission  was  probably  due  to 
inefficiency;  there  was  little  empirical  evidence  either  to  support  or  to  refute  their 
decision.  Because  they  did  not  wish  to  encourage  large  size,  they  controlled  for 
hospital  size  in  calculating  the  indirect  cost  of  teaching  but  excluded  size  from  the 
payment  system.  They  therefore  adopted  the  measure  of  teaching  effect  which  had 
been  calculated  after  controlling  for  the  effect  of  size—about  a  5.8  percent  increase 
in  cost  per  case  for  every  tenth  of  an  intern/resident  per  bed  (Table  4.1A).  Had  they 
excluded  bedsize  from  the  calculation  of  the  teaching  adjustment,  they  would  have 
found  an  8.5  percent  increase  in  cost  for  every  tenth  of  an  intern/resident  per  bed 
(Table  4. IB).  Although  the  magnitude  of  these  adjustments  is  somewhat  different  in 
calculations  for  1984,   the  relationship  is  similar.     Teaching   hospitals   argued   to 

4-5 


Congress  that  they  would  lose  money  under  the  5.8  percent  adjustment;  Congress 
responded  by  doubling  the  initially  proposed  teaching  adjustment  to  11.6  percent. 
Thus  Congress  overcompensated  for  the  decrease  which  had  resulted  from 
controlling  for  the  effect  of  hospital  size. 

The  designers  of  PPS  did  not  attempt  to  determine  whether  the  increased  costs 
in  teaching  hospitals  related  to  the  cost  of  teaching,  the  cost  of  caring  for  sicker 
patients,  the  cost  of  more  intensive  treatment,  or  inefficiency.  Through  a  contract 
from  the  Office  of  the  Assistant  Secretary  for  Planning  and  Evaluation,  the 
Department  has  just  completed  the  Graduate  Medical  Education  study,  which 
examines  the  sources  of  these  cost  differences.  Analyses  are  just  being  completed 
and  are  being  published  separately. 

There  is  also  evidence  that  the  indirect  teaching  adjustment  has  a  more 
complex  relation  to  the  number  of  interns  and  residents  per  bed  than  the  simple 
percentage  increase.  The  indirect  teaching  adjustment  actually  is  somewhat  too 
high  for  hospitals  with  high  teaching  activity  compared  to  those  with  low  activity. 
For  example,  Cretin  et  al.  (1986)  show  that,  at  national  rates,  profitability  of 
teaching  hospitals  rises  twice  as  rapidly  with  increased  teaching  for  hospitals  with 
0.25-0.'*9  residents  per  bed  as  for  hospitals  with  0.01-0.2^  residents  per  bed  (see  also 
Sheingold  1985  and  Cotterill  et  al.,  1985  for  comparable  results  from  other  data).  In 
COBRA,  Congress  legislated  changes  in  the  teaching  adjustment  which  bring 
payments  slightly  closer  to  this  nonlinear  relationship. 

D.  ORG  MEASUREMENT  OF  CASE  MIX.  In  Section  IV  of  this  chapter  we  discuss 
the  evidence  that  DRG  weights  are  "compressed"~that  is,  weights  for  complex 
cases  are  too  low  and  weights  for  simple  cases  are  too  high.  If  weights  aire 
compressed,   a   hospital's   average   cost   per   case    will   rise    faster   than   its   CMI. 
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Whether  weights  appear  compressed  depends,  however,  on  the  way  in  which  hospital 
costs  are  predicted  from  independent  variables.  Tables  4.1  A  and  4.1B  show  that, 
again,  whether  weights  appear  compressed  depends  on  whether  the  calculation 
includes  hospital  size.  If  the  equation  includes  residentsA>ed,  wage  index,  CMI,  and 
urban/rural  differences,  then  cost  per  discharge  goes  up  about  12.8-14.6  percent  for 
every  10  percent  increase  in  the  CMI,  indicating  compression.  But  when  hospital 
size  is  included,  the  increase  in  cost  is  almost  exactly  10  percent  for  every  10 
percent  increase  in  the  CMI.  Thus,  including  the  effect  of  hospital  size  when 
estimating  the  payment  corrections  and  excluding  it  from  payment  because  of  a 
suspected  relationship  to  inefficiency  also  solved  the  apparent  problem  of  DRG 
weight  compression.  We  will  return  to  the  hospital  size  issue  in  Section  VII. 
E.  OUTLIER  POLICY.  At  first  examination,  outlier  policy  was  based  on  a  wholly 
rational  economic  decision  that  the  outlier  system  would  pay  only  the  marginal  costs 
of  additional  care  and  by  using  the  best  available  estimates  of  such  costs.  More 
careful  examination  makes  clear  that  the  apparently  mathematical  and  economic 
decisions  were  shaped  by  policy  judgments  as  much  as  by  numbers,  as  two  examples 
will  illustrate: 

1.  Marginal  Costs  for  Cost  Outliers.  In  implementing  the  PPS  law,  marginal 
cost  is  defined  as  the  additional  costs  of  care  beyond  the  outlier  threshold.  There  is 
no  evidence  that  the  additional  cost  of  services  provided  to  cost  outliers  beyond  the 
threshold  is  different  from  the  cost  of  services  provided  to  these  cases  before  the 
threshold  is  reached.  Data  available  when  implementing  regulations  were  written 
suggested  that  marginal  costs  for  day  outliers  were  probably  lower  because  there 
were  substantial  fixed  costs  associated  with  extra  days  of  stay,  and  this  evidence 
was  extrapolated  to  cost  outliers.  Section  VI  discusses  current  evidence  on  the 
marginal  cost  of  day  and  cost  outliers. 
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2.  Limiting  Outlier  Payments  to  the  Federal  Share.  The  architects  of  PPS 
decided  to  apply  outlier  payments  only  to  the  Federal  portion  of  the  payment  (25 
percent  in  the  first  year)  and  not  to  the  hospital-specific  portion  (75  percent  in  the 
first  year).  To  the  extent  that  outlier  policy  was  intended  to  protect  expensive 
hospitals  with  a  disproportionate  share  of  outliers,  this  was  logical  because  the 
hospital-specific  portion  of  the  payment  rates  incorporated  the  costs  which  the 
hospital  incurred  from  having  a  large  number  of  outliers  in  the  base  year.  But 
outlier  policy  was  also  intended  to  provide  insurance  against  the  risk  of  occasional 
very  expensive  cases  for  smaller  hospitals  with  average  costs.  The  small  hospital 
needs  this  insurance  because  it  does  not  have  enough  discharge  to  "average  out" 
exceptional  cases.  The  fixed  hospital-specific  portion  of  the  payment  causes  as 
much  risk  as  the  fixed  Federal  portion,  but  the  insurance  applied  only  to  the  latter. 

These  two  decisions  on  outlier  payment  were  shaped  by  a  common  policy  goal, 
which  was  to  assure  that  while  the  outlier  payment  system  reduced  the  risk 
associated  with  outliers  it  would  under  no  circumstances  make  outliers  fiscally 
attractive. 

F.  DISPROPORTIONATE  SHARE  OF  MEDICARE,  MEDICAID,  AND  LOW  INCOME 
PATIENTS.  The  original  PPS  legislation  authorized  the  Secretary  to  find  a  way  to 
make  special  adjustments  in  payments  to  hospitals  with  a  disproportionate  share  of 
Medicare,  Medicaid,  and  other  low  income  patients.  Recent  legislation  (COBRA) 
establishes  specific  formulas  for  increasing  payments  which  are  based  on  the 
proportion  of  a  hospital's  Medicare  patients  who  are  eligible  for  Supplemental 
Security  Income  payments  and  the  proportion  of  non-Medicare  patients  who  are 
eligible  for  Medicaid.  This  adjustment  does  not  necessarily  rest  on  needs  of 
individual  patients,  but  to  the  extent  that  PPS  is  built  on  a  case  mix  model, 
Congress   thus   implicitly   accepted  poverty   as  a   legitimate   proxy    for   case    mix 
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variation  which  was  unmeasured  by  the  CMI  and  the  outlier  adjustment.  HCFA  is 
examining  whether  this  difference  is  actually  observable  under  PPS. 
G.  REGIONAL  ADJUSTMENTS.  Regional  differences  in  hospital  treatment  patterns 
are  well  documented  and  remain  a  perplexing  problem  for  Medicare  policy.  The 
assumption  in  the  law  is  that,  in  the  long  run,  they  need  not  be  recognized— perhaps 
because  they  are  thought  to  result  from  unnecessary  regional  variations  in 
efficiency  or  treatment  practices;  the  regional  component  of  payment  rates  is 
scheduled  to  phase  out  by  1989  (see  Table  1.1).  These  differences  are  not  at  all  well 
understood  and  might  result  from  differences  in  patterns  of  illness  or  hospitalization 
which  should  be  considered  in  setting  payment  policy;  there  are,  however,  no  data  to 
support  such  explanation. 

H.  CONCLUSIONS.  This  analysis  shows  how  apparently  mathematical  decisions 
required  policy  choices,  which  often  produced  payment  rules  which  were  intended  to 
be  efficient  rather  than  those  which  would  most  closely  reflect  current  medical 
practice.  Thus,  significant  reallocations  of  Medicare  payments  among  types  of 
hospitals,  among  regions  of  the  country,  and  between  typical  and  atypical  patients 
were  built  into  PPS  by  its  designers.  Although  the  circumstances  which  shaped 
those  decisions  may  have  changed  somewhat  in  3  years,  and  though  we  now  have 
more  data  with  which  to  assess  whether  they  should  be  continued,  it  is  important  to 
keep  in  mind  that  the  goal  of  PPS  was  neither  to  perfectly  duplicate  existing 
payment  patterns  nor  to  pay  costs.  The  architects  of  PPS,  both  in  the 
Administration  and  in  Congress,  clearly  intended  to  go  beyond  those  standards  and 
to  pay  what  they  judged  to  be  the  costs  of  delivering  necessary  care  in  hospitals  of 
average  efficiency. 
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II.  HOW  PATIENTS  WITH  HIGH  TREATMENT  NEEDS  CONCENTRATE  IN  SOME  HOSPITALS 

At  this  point,  we  change  focus  slightly  and  consider  how  interhospital 
differences  in  case  mix  actually  occur.  Since  urban/rural  location  and  teaching 
activity  are  only  proxies  for  patient  needs,  we  need  a  model  of  how  patients  with 
greater  needs  might  concentrate  in  some  hospitals.  If  all  hospitals  received  a 
random  selection  of  patients,  we  would  expect  only  random  or  nonsystematic 
variation  in  case  mix  among  hospitals.  Different  hospitals  might  thus  show  very 
similar  patterns  of  variation  within  the  uncomplicated  myocardial  infarction  DRG  if 
all  patients  with  myocardial  infarction  were  simply  taken  to  the  physically  nearest 
hospital. 

However,  different  hospitals  appear  to  have  patients  with  different  average 
care  needs,  and  these  differences  appear  to  result  from  differences  between 
hospitals  along  three  dimensions: 

—who  lives  in  the  hospital's  catchment  area, 

--how  patients  are  referred  to  different  hospitals,  and 

—how  hospitals  define  the  needs  of  patients. 

A.  CATCHMENT  AREA  FACTORS.  An  obvious  source  of  difference  between 
hospitals  is  differences  between  catchment  areas  which  they  serve.  Thus,  a  hospital 
serving  a  catchment  area  with  an  excess  of  the  very  poor,  nursing  home  residents,  or 
alcoholics  is  likely  to  have  a  patient  population  with  different  needs  from  a  hospital 
with  an  average  catchment  area  population.  Measuring  these  differences  is 
difficult,  and  we  are  not  aware  of  research,  either  complete  or  in  progress,  which 
would  permit  relating  catchment  area  factors  to  hospital  level  of  severity. 

B.  REFERRAL  PATTERNS.  Hospitals  acquire  a  particular  mix  6f  patients,  which 
differs  from  that  of  other  hospitals  with  similar  catchment  areas,  through  a  variety 
of  referral  systems. 
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1.  Reasons   for    Referrals.      The   reasons   for   referrals   to   particular   hospitals 
include  hospital  characteristics  such  as: 

—availability  of  diagnostic  or  therapeutic  services, 

— perceived  superior  or  inferior  services, 

—perceived  willingness  or  unwillingness  to  accept  patients  in  certain  nnedical  or 

financial  conditions,  and 
—desire     to     take     advantage     of    or     avoid     a     style    of    practice     which     is 

characteristic  of  a  hospital. 

2.  Mechanisms  for  Referrals.  The  mechanisms  of  referral  also  vary  widely  and 
include: 

—hospital-to-hospital  transfers; 

—emergency  room  transfers; 

—selection  of  hospitals  by  physicians,  especially  physicians  with  privileges  at 
more  than  one  hospital  who  admit  selectively  or  differentially; 

—referral  agreements  with  institutions  such  as  nursing  homes  and  community 
health  centers; 

—ambulance  runs,  emergency  medical  service  networks,  and  similar  transport 
systems;  and 

— self-referral  by  patients. 

Such  referral  mechanisms  produce  concentrations  of  different  kinds  of  patients 
in  different  hospitals.  To  the  extent  that  such  concentrations  are  based  on  clinical 
judgment  by  physicians,  classification  systems  are  unlikely  to  capture  the  impact  on 
hospital  case  mix  because  clinical  referrals  are  likely  to  be  made  for  more  subtle 
reasons  than  available  patient  data  can  capture. 

C.  HOSPITAL  DIFFERENCES  AND  DIFFERING  DEFINITIONS  OF  PATIENT  NEEDS. 
The  increased  cost  per  case  observed   in   teaching  hospitals  is  often  attributed  to 
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patients  being  sicker  or  needing  more  intense  care.  In  Chapter  2,  however,  we 
pointed  out  the  limitations  of  the  "severity"  model  of  patient  needs.  We  begin  here 
to  develop  a  more  complex  model  of  hospital  differences  which  builds  on  our 
knowledge  of  the  limits  of  the  severity  model  and  which  appears  to  take  better 
account  of  known  facts.  To  illustrate  this  model,  we  will  describe  a  hypothesis  as  to 
how  patients  differ  between  major  teaching  and  community  hospitals. 

Major  teaching  hospitals  can  be  described,  for  this  hypothesis,  as  having  two 
kinds  of  care. 

1.  Specialty  Units.  This  care  is  concentrated  in  highly  specialized,  acute 
referral  units,  such  a  trauma,  burn,  neonatal  intensive  care,  and  immune  deficiency 
services.  In  these  units,  which  represent  a  small  fraction  of  teaching  hospital 
admissions,  severity  of  illness  is  often  very  high  and  costs  are  very  high  as  a  result 
of  this  severity.  Costs  maybe  much  higher  for  patients  admitted  to  such  units  than 
for  patients  in  the  same  DRG  not  admitted  to  such  units. 

2.  Routine  Hospital  Operations.  This  care  includes  patients  referred  for 
diagnostic  and  special  treatment  services,  but  generally  on  a  nonemergency  basis. 
These  patients  are  not  more  severely  ill  than  comparable  patients  in  community 
hospitals,  but  a  large  fraction  of  them  receive  more  intense  care  for  a  variety  of 
reasons: 

—They  are  referred  for  subtle  or  highly  specialized  diagnostic  evaluation  or  for 
highly  specialized  judgment  as  to  appropriate  treatment. 

—They,  their  doctors,  and/or  their  family  want  everything  possible  done  to 
preserve  life  or  functioning.  Here  the  existence  of  a  house  staff  which  will  do 
everything  possible  at  3  in  the  morning  is  a  reason  for  referral.  By  contrast, 
many  physicians  might  prefer  to  admit  a  clearly  dying  patient  to  a  community 
hospital  if  only  supportive  care  and  relief  of  suffering  were  sought. 
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—A  significant  number  of  referred  patients  arrive  with  early  disease  but  hope 
for  curative  treatment.    These  patients  are  precisely  those  who  confound  the 
severity  model  by  receiving  care  which  is  out  of  proportion  to  the  severity  of 
their  illness. 
—The  hospital  is  organized  to  identify  the  need  for  and  to  carry  out  a  larger 
number  of  therapeutic  interventions  for  routine  and  unreferred  patients  than 
is  a  community  hospital.    Although  the  appropriateness  of  these  interventions 
tends  to  be  supportable  in  terms  of  the  medical  literature,  there  is  no  good 
literature    to   suggest   that   matched  patients  have  better   long-term   survival 
from  university  hospitals  than   from   community  hospitals  (see   Garber  et  al. 
198^  for  confirmatory  data). 
The  exact  explanation  for  the  differences  between  high  intensity  and  low  intensity 
hospitals   is   very   complex   and   involves  at   least   two    separate    issues:    technology 
diffusion  and  induced  demand. 

3.   Technology    Diffusion.      New    procedures  and   new    methods  necessarily  are 
introduced  in  a  few  facilities  and,  if  successful,  diffuse  outward  to  other  facilities. 
Most    of    the    time,    these    innovations    begin    at    large   hospitals,   usually    teaching 
hospitals,  and  move  outward  from  these  centers 
both  geographically  and  intellectually. 

'f.  Induced  Demand.  For  some  of  new  procedures,  patients  will  come  to  the 
centers  of  innovation,  but  in  many  other  cases  the  procedures  will  simply  be 
available  to  those  patients  who  come  to  the  centers  for  other  reasons.  For  these 
latter  patients,  economists  might  say  that  the  availability  of  the  procedure  induces 
demand.  But  it  is  important  to  distinguish  the  kind  of  induced  demand  in  which  the 
service  is  clearly  medically  appropriate  from  induced  demand  where  the  need  for 
the  service  is  uncertain.    There  is  considerable  debate  about  whether  large,  teaching 
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hospitals  provide  large  numbers  of  unnecessary  services  and,  if  so,  whether  this 
occurs  because  of  induced  dennand  or  because  of  inexperienced  house  staff  who 
order  unnecessary  services. 

The  idea  that  different  hospitals  may  legitimately  define  patient  needs 
differently  creates  profound  problems  for  the  severity  model  of  patient  needs.  It 
also  raises  some  question  as  to  whether  increased  costs  in  larger  hospitals  can  be 
entirely  dismissed  as  the  result  of  inefficiency,  or  whether  larger  hospitals,  with 
more  consultants  and  more  equipment,  may  share  some  characteristics  with 
teaching  hospitals. 

III.  EMPIRICAL  EVIDENCE  ON  THE  DIFFERENTIATION  OF  HOSPITALS 
One  critical  empirical  question  regarding  the  severity  model  is  whether  it  can 
account  for  differences  currently  dealt  with  by  the  urban/rural  and  teaching 
adjustments.  A  severity  measure  need  not  justify  current  payments  for  teaching  or 
current  urban/rural  differences,  but  it  should  substantially  clarify  the  contribution 
of  severity  to  the  increased  costs  of  urban  and  teaching  hospitals. 

The  difficulty  in  analyzing  differences  effectively  is  that  we  have  no  data  set 
which  directly  measures  clinical  severity  of  illness  in  different  types  of  hospitals 
even  if  we  accept  such  measures  as  APACHEII,  MEDISGRPS,  and  SOU  as  valid 
measures  of  severity.  We  have  no  measure  of  medically  appropriate  intensity  of 
care  independent  of  severity.  Thus,  all  evidence  regarding  patient  differences 
across  classes  of  hospitals  is  somewhat  indirect. 

A.  DIFFERENCES  IN  PATIENT  TREATMENT  BETWEEN  TEACHING  AND 
NONTEACHING  HOSPITALS.  Good  research  on  differences  in  treatment  between 
teaching  and  nonteaching  hospitals  is  sparse.  Garber  et  al.  (198^)  compared 
treatment  on  teaching  and  nonteaching  services  in  a  university  hospital.    They  found 


that  patients  in  the  teaching  service  had  higher  costs,  higher  survival  rates  in 
hospital,  but  equal  survival  rates  at  9-rnonth  followup  compared  to  carefully 
inatched  patients  on  the  nonteaching  service.  These  data  strongly  suggest,  btt  do 
not  prove,  that  more  intensive  therapeutic  efforts  were  made  on  the  teaching 
service  and  that  the  more  intense  effort  resulted  in  higher  cost  and  better  short- 
term  survival  but  no  change  in  long-term  survival. 

Cretin   et   al.  (1986)  report   that  the  correlation  of  profitability  of  individual 

2 
DRGs  in  nonteaching  and  teaching  hospitals  is  0.61-0.7^  (R  -Q.37-0.55).    This  means 

that  about  half  of  the  variation  in  profitability  among  DRGs  in  teaching  hospitals  is 

reflected   in   the   profitability   of   the   same    DRGs  in   nonteaching   hospitals.      This 

suggests    that    teaching    hospitals    are    not    simply    doing    more    than    nonteaching 

hospitals;     they     are     doing     different     things     in     different     patterns     which     are 

diagnosis/related. 

B.    DIFFERENCES    IN    CASE    MIX    BETWEEN    TEACHING    AND    NONTEACHING 

HOSPITALS.    Examination  of  the  CMI  and  outlier  payments  reveals  that  case  mix  as 

measured  by  PPS  is  significantly  higher  in  teaching  than  in  nonteaching  hospitals. 

But  a  further  question  is  whether  case  mix  in   teaching  hospitals  is  actually  even 

higher  but  not  fully   measured  by   PPS.     Such  an  unmeasured  case  mix  difference 

would  provide  strong  justification  for  the  higher  costs  of  teaching  hospitals,  which 

are  reflected  in  the  teaching  adjustment.    Studies  by  Coffey  and    Goldfarb  (198^), 

Coffey  (1985),  Short  and  Coffey  (198^),  and  Mitchell  et  al.  (1985b)  tell  a  story  which 

is  relatively  consistent  and  coherent. 

Coffey  and  associates  of  the  National  Center  for  Health  Services  Research  and 

Health  Care  Technology  Assessment,  have  conducted  a  number  of  studies  which  used 

Disease    Staging    to    examine    the    differences    between    teaching   and    nonteaching 

hospitals.      Disease    Staging    (Chapter    3.III)    classifies    patients    according    to    the 
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progression  of  the  disease—stage  1  is  an  early  highly  localized  condition  and  stage  ^ 
is  death.  Although  Disease  Staging  depends  on  discharge  abstract  data  and  does  not 
include  any  physiologic  measures  of  severity  of  illness,  it  should  indicate  whether 
patients  have  more  advanced  disease  and,  through  use  of  death,  should  provide  a 
measure  of  severity.  Coffey  and  colleagues  found  consistently  that  Disease  Staging 
did  not  demonstrate  a  significant  difference  in  case  mix  between  teaching  and 
nonteaching  hospitals. 

Coffey  and  Goldfarb  and  Mitchell  et  al.  (1985b)  did  find  consistently  greater  use 
of  procedures,  including  surgery,  in  teaching  hospitals  compared  to  nonteaching 
hospitals.  Their  data  did  not  permit  them  to  distinguish  patients  referred  to  the 
teaching  hospital  for  specialized  surgery  from  those  who  were  simply  routine  users 
of  the  teaching  hospital  but  were  more  likely  to  have  surgery  for  their  conditions. 
Thus,  the  greater  use  of  procedures  in  teaching  hospitals  appears  not  to  be  explained 
by  the  relatively  crude  adjustment  provided  by  staging. 

C.  TREATMENT  PATTERNS  ACROSS  REGION.  Eggers  (unpublished)  has  provided  a 
striking  analysis  of  the  different  ways  in  which  hospitals  are  used  in  different 
States.  He  compared  the  number  of  patients  dying  in  hospitals  with  the  number  of 
patients  dying  within  30  days  of  a  hospital  admission  (Table  ^.3).  (This  number  can 
be  negative  if  more  patients  die  after  a  hospital  stay  of  more  than  30  days  than  die 
outside  of  the  hospital  during  the  30  days  following  admission.)  The  differences 
across  States  are  dramatic  and  suggest  how  differently  hospitals  fit  into  the  overall 
health  care  system  in  different  States.  These  data  provide,  however,  no  guidance  on 
the  extent  to  which  differences  reflect  differences  in  quality  of  care  as  opposed  to 
differences  in  the  way  the  system  cares  for  dying  patients. 

Mitchell  et  al.  (1983b)  used  a  somewhat  different  method.  Using  Medicare  data 
from  four  States,  they  examined  clusters  of  DRGs  associated  with  cerebrovascular 
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disease,  coronary  artery  disease,  prostatic  disease,  and  pneumonia.  They  also  used 
Disease  Staging  as  a  measure  of  disease  severity  within  DRG  cluster  and  used  the 
presence  of  specific  diagnoses  to  further  refine  their  estimates.  Again,  these 
measures  accounted  for  very  little  of  the  difference  either  in  cost  or  in  procedures 
performed.  It  is  difficult  to  imagine  any  system  of  referral  or  any  catchment  area 
differentiation  which  would  account  for  the  differences  in  the  rates  of  performance 
of  certain  procedures,  such  as  CAT  scans  in  patients  with  stroke,  which  they 
observed.  The  data  strongly  suggest  that  hospitals  in  different  States  provide 
different  services  to  patients  who  are  more  similar  than  the  mix  of  services  would 
suggest. 

Nevertheless  other  data  suggest  that  differences  across  States  may  reflect 
other  factors  which  would  affect  both  assignment  of  patients  to  DRGs  and  severity 
of  illness  within  DRGs.  First,  rates  of  hospitalization  differ  sharply  across  States 
(Beebe  et  al.  1985).  Second,  diagnostic  patterns  differ  across  States  in  ways  which 
might  well  result  in  different  DRG  assignments  for  rather  similar  patients  (lezzoni 
and  Moskowitz  198^).  Third,  frequency  of  surgery  and  other  procedures  differs 
across  States  very  dramatically  (Chassin  et  al.  1986).  Mitchell  et  al.  (1985b)  also 
show  that  both  surgery  rates  per  1,000  Medicare  beneficiaries  and  surgery  rates  per 
admission  vary  dramatically  across  the  four  States  they  studied  (Table  liA).  Fourth, 
Mitchell  et  al.  find  substantial  differences  in  the  degree  of  disease  progression 
across  States  for  a  number  of  DRGs;  although  Stage  is  a  most  imperfect  measure  of 
severity,  finding  substantial  differences  in  Stage  across  States  suggests  either 
differences  in  patients  or  differences  in  diagnostic  practices.  Finally,  Mitchell  et 
al.-  show  large  DRG-specific  differences  in  death  rates  per  admission  across  States 
(Table  ^.5),  suggesting  that  hospitals  are  being  used  for  terminal  care  in  some  States 
but  not  in  others.     The  cumulative  effect  of  these  findings  is  to  cast  significant 
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doubt  on  whether  hospital  stays  have  the  same  purposes  and  whether  DRGs  have 
fully  comparable  patients  across  States. 

D.  URBAN/RURAL,  TEACHING/NONTEACHING.  AND  HOSPITAL  SIZE 
DIFFERENCES  IN  CARE.  Cromwell  et  at.  (1985)  examined  differences  in  inpatient 
C»re  controlling  for  three  dimensions  of  hospital  differences:  urban/rural, 
tcaching/nonteaching,  and  large/small.  The  picture  is  relatively  consistent  across 
the  three  classes— patients  in  large,  urban,  and  teaching  hospitals  tend  to  get  more 
procedures  of  all  kinds  than  those  in  small,  rural,  and  nonteaching  hospitals.  Their 
major  control  for  severity  of  illness  was  Disease  Staging,  whose  power  we  have 
already  reported  to  be  fairly  modest,  but  the  results  are  similar  to  and  confirmatory 
of  Goldfarb  and  associates.  Cromwell's  findings,  like  Goldfarb's,  are  striking  not 
because  Staging  failed  to  explain  all  the  difference  between  hospital  classes  but 
because  it  explained  almost  none. 

The  relationship  of  hospital  size  to  cost  of  care  varies  in  a  way  which  suggests 
that  the  key  issue  may  be  differences  between  medium-sized  and  small  hospitals 
rather  than  between  medium-sized  and  large  hospitals.  Table  U,2  shows  that,  after 
controlling  for  teaching,  case  mix,  urban/rural,  and  labor  costs,  cost  per  discharge 
was  17.6  percent  higher  in  hospitals  of  100-149  beds  than  in  hospitals  with  less  than 
50  beds.  Average  cost  per  discharge  in  hospitals  over  399  beds  was  only  6.3  percent 
higher  than  in  hospitals  of  100-1'»9  beds.  Cretin  ct  al.  (1986  Figures  '♦.I  and  ^^.2) 
found  a  comparable  pattern  in  which  profits  varied  much  more  in  the  lower  bed 
range  than  in  the  upper;  they  also  found  that  profits  actually  rose  in  the  very  largest 
hospitals,  suggesting  that  costs  fell.  Thus,  the  relation  of  cost  to  hospital  size 
mainly  reflects  differences  between  small  hospitals  and  all  other  hospitals.  These 
finding  are  at  least  consistent  with  the  hypothesis  that  the  small  hospitals  do  less 
lor    patients    and    that    patients    with    greater    needs    are    either    transferred    or 
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selectively  admitted  to  larger  hospitals.  Such  a  pattern  is  easier  to  imagine 
between  small  and  medium-sized  hospitals  than  between  medium-sized  and  large 
hospitals. 

E.  DIFFERENCE  IN  COSTS  BETWEEN  SETTINGS.  Cost  differences  are  strikingly 
illuminated  by  Cromwell  et  al.'s  examination  of  prostate  disease,  which  appears  to 
vary  little  in  either  Disease  Stage  or  services  rendered  across  types  of  hospitals. 
They  found  that  the  pattern  of  stages  for  both  benign  prostatic  hypertrophy  and 
prostatic  cancer  was  generally  similar  across  hospital  types.  The  only  differences 
were  a  somewhat  higher  staging  of  hypertrophy  in  small  hospitals  and  a  slightly 
higher  staging  of  both  hypertrophy  and  cancer  in  nonteaching  and  minor  teaching 
hospitals,  and  in  urban  hospitals  (Cromwell  et  al.  1985;  Tables  7,  13,  and  19). 

When  they  examined  the  mix  of  procedures  for  these  two  conditions  they  also 
found  a  very  homogeneous  pattern  of  procedures  and  a  very  similar  rate  of 
performing  procedures  in  urban  and  rural  hospitals  and  in  teaching  and  nonteaching 
hospitals  (Cromwell  et  al.^1985;  Tables  8  and  1^),  but  a  significant  trend  to 
performing  more  procedures  in  large  hospitals  compared  to  small  hospitals 
(Cromwell  et  al.  1985;  Table  20). 

Despite  the  homogeneity  of  Disease  Stage  and  procedures,  urban  hospitals  had 
18.9  percent  higher  costs  than  rural  hospitals  after  adjustment  for  wage  differences 
(Table  2),  major  teaching  hospitals  had  costs  70.9  percent  higher  than  nonteaching 
hospitals  (Table  9),  but  large  hospitals,  where  more  procedures  were  performed,  had 
costs  only  6.2  percent  higher  than  small  hospitals  (Table  15).  In  these  cases, 
therefore,  neither  severity  of  illness  nor  more  intensive  use  of  procedures  appears  to 
drive  the  higher  costs;  instead,  per  diem  costs  are  higher  in  the  urban  and  teaching 
hospitals,  and  unit  costs  for  procedures  may  also  be  higher. 

Clearly,  this  single  study  does  not  explain  the  differences  between  classes  of 
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hospitals.    Work  in  progress  by  lezzoni  at  Boston  University,  Ashcraft  at  University 
of  Michigan,  and  Alemi  at  Tulane  promises  to  clarify  the  picture  substantially. 

The  next  section  of  this  chapter  examines  issues  in  hospital  cost  finding  which 
may  explain  some  of  the  cost  differences  reported  by  Cromwell.  Briefly,  unit  costs 
tend  to  be  "cross-subsidized":  hospital  cost  finding  tends  to  average  costs  for  all 
patients  so  that  reported  costs  are,  therefore,  higher  than  actual  costs  for  the 
simplest  cases  in  a  hospital  and  reported  costs  are  lower  than  actual  costs  for  the 
most  complex  cases  in  a  hospital.  If  this  cross-subsidization  occurs,  it  could  make 
moderate  cost  patients  such  as  those  with  prostate  disease  appear  to  have  higher 
costs  than  they  actually  incur  in  teaching  hospitals  and  appear  to  have  lower  costs 
than  they  actually  incur  in  rural  hospitals. 

IV.  SETTING  DRG  WEIGHTS  AND  THE  PROBLEM  OF  COMPARING  HOSPITALS 
This  section  examines  two  problems  in  setting  the  relative  weights  for  DRGs.  Part 
A  describes  the  technical  problems  in  comparing  hospitals  which  result  from  a 
phenomenon  called  "weight  compression";  Part  B  examines  the  problem  of 
computing  suitable  weights  or  prices  for  making  comparisons  between  hospitals. 
A.  DRG  WEIGHT  COMPRESSION.  Pettengill  and  Vertrees  (1982),  and  more  recently 
Lave  (1985)  and  Pauly  and  Wilson  (1985),  have  examined  evidence  that  DRG  weights 
might  be  "compressed"~that  is,  that  the  payment  weights  may  be  too  high  for 
simple  DRGs  and  too  low  for  complex  DRGs.  Compression  in  the  weights  would  be 
likely  to  result  in  payments  which  were  too  high  for  hospitals  with  a  simple  case  mix 
and  too  low  for  hospitals  with  a  complex  case  mix.  Four  forces  can  produce 
compressed  weights:  differences  in  cost  patterns,  data  errors,  cross-subsidization  in 
per  diem  cost  finding,  and  cross-subsidization  in  ancillary  charges. 

1.  Differences  in  Cost  Patterns.    Payments  are  based  on  relative  costliness  of 


patients  in  different  DRGs  in  all  PPS  hospitals.  This  means  they  are  averages  of 
costs,  and  hence  of  practice  in  different  kinds  of  hospitals.  Averaging  two  or  more 
sets  of  DRG  weights  which  are  not  perfectly  consistent  will  result  in  average 
weights  which  are  more  compressed  than  any  of  the  sets  of  weights  which  were 
averaged,  a  statistical  phenomenon  termed  "regression  to  the  mean." 

DRG  weights  derived  from  different  kinds  of  hospitals  and  different  States  are 
not  perfectly  correlated.  Although  Pettengill  and  Vertrees  (1982)  found  similar 
weights  across  regions,  they  found  substantial  variation  in  patterns  of  weights  across 
hospital  types.  Lave  reports  significant  differences  between  the  cost  weights  for 
New  Jersey,  Maryland,  and  Medicare  data.  Cretin  et  al.  (1986)  found  that  patterns 
of  profitability  differ  between  teaching  and  nonteaching  hospitals,  suggesting  that 
cost  weights  calculated  for  these  two  groups  of  hospitals  would  differ  significantly. 
All  of  these  findings  indicate  that  national  weights  based  on  averages  would  be 
compressed.  The  fact  that  weights  are  computed  by  averaging  individual  cases 
rather  than  sets  of  weights  does  not  protect  against  this  source  of  compression. 

2.  Data  Errors.  Data  errors  which  result  in  misclassifying  cases  or  in 
misreporting  costs  blur  the  sharp  distinctions  between  classes  of  cases,  make  cases 
look  more  like  one  another  than  they  really  are,  and  result  in  cost  weights  which  are 
also  more  alike  than  they  should  be— i.e.,  compressed.  Data  errors  appear  to  have 
been  significant  in  the  data  used  to  establish  weights  (Worthman  and  Cretin  1986). 
Although  Lave  (1985)  argues  that  PPS  will  correct  this  effect  through  increasing 
attention  to  coding,  and  Carter  and  Ginsburg  (1986)  have  provided  evidence  that 
such  correction  is  taking  place,  the  role  of  data  error  in  current  weights  cannot  be 
established. 

3.  Cross -Subsidization  of  Per  Diem  Charges.  Lave  (1985)  points  out  that  the 
basic   Medicare  cost   finding   mechanism   produces   uniform    routine   per   diem    and 
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special  care  per-diem  costs  for  all  cases  and  that  this  inherently  produces  cross- 
subsidization  between  cases  with  intensive  per  diem  care  needs  and  those  with  less 
intensive  needs.  A  study  by  the  American  Nurses'  Association  (McKibbin  et  al.  1985) 
indicates  that  such  cross-subsidies  do  occur  and  current  work  by  Thompson  et  al. 
(198^)  supports  these  findings. 

'f.  Cross-Subsidization  of  Ancillary  Charges.  In  contrast  to  per  diem  cross- 
subsidization,  which  is  partly  the  result  of  the  Medicare  cost-finding  procedure, 
cross-subsidization  of  ancillary  charges  results  from  inaccurate  charging  by 
hospitals.  Evidence  for  such  cross-subsidies  is  much  weaker  than  evidence  regarding 
per  diems,  and  Lave  argues  cogently  that  such  cross-subsidies  are  likely  to  disappear 
under  the  pressure  of  competition,  which  will  generally  force  providers  to  price 
individual  services  more  accurately  in  order  to  remain  competitive. 

This  analysis  makes  clear  that  DRG  weight  compression  does  occur.  The 
hypothesis  can  be  tested  in  fairly  simple  fashion  by  determining  whether  hospital 
cost  per  case  changes  more  than  10  percent  for  every  10  percent  change  in  the  CMI. 
Table  ^.1  shows  that,  in  1981,  average  cost  per  discharge  changed  28.91  percent  for 
every  10  percent  change  in  the  CMI;  after  controlling  for  teaching  intensity, 
urban/rural,  and  the  wage  index,  average  cost  changed  1^.6  percent  for  every  10 
percent  change  in  the  CMI;  and  controlling  for  hospital  size  reduces  the  change  to 
10.1  percent  for  every  10  percent  change  in  the  CMI.  Thorpe  et  al.  (1986)  report 
that  in  1984  estimated  cost  per  case  rose  12.8  percent  for  every  10  percent  rise  in 
the  CMI,  after  controlling  for  teaching,  urban/rural  location,  and  wage  index,  and 
region,  but  that  the  increase  was  only  10.6  percent  for  every  10  percent  rise  in  the 
CMI  when  they  included  hospital  size  in  the  predictive  equation.  Thus,  evidence  of 
compression  is  present  unless  hospital  size  is  included  in  the  equation  predicts  cost 
per  case.   We  have  already  pointed  out  that  the  practical  importance  of  compression 
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is  that  hospitals  with  complex  case  mix  would  be  underpaid  and  hospitals  with  simple 
case  mix  would  be  overpaid.  To  examine  this  issue,  Thorpe  et  al.  "decompressed" 
DRG  weights  in  such  a  way  that  the  average  weight  did  not  change  but  weights 
above  the  average  were  increased  and  weights  below  the  average  were  decreased. 
This  approach  also  decompressed  the  hospital  CMIs.  The  effect  is  to  reduce 
evidence  of  compression  with  negligible  impact  on  simulated  program  outlays. 
Thorpe  et  al.  examined  the  effect  of  decompression  both  including  and  excluding 
hospital  size  from  the  predictive  equation.  With  size  excluded,  they  found  that 
decompression  would  reduce  the  indirect  teaching  adjustment  by  12.2  percent;  with 
size  included,  compression  was  very  small  and  decompression  had  a  negligible  effect 
(Table  4.6).  These  findings  suggest  that  the  decompression  strategy  used  in  Thorpe's 
model  is  an  oversimplification  of  the  actual  nature  of  compression.  Apparently, 
some  weights  are  more  compressed  than  others  and  the  compression  of  weights  is 
largely  concentrated  in  DRGs  which  have  higher  costs  in  teaching  than  in 
nonteaching  hospitals  and  higher  costs  in  large  hospitals  than  in  small  hospitals.  We 
would  expect  this  effect  if  compression  actually  results  from  averaging  case  mix 
patterns  which  differ  across  hospital  types. 

Thus,  paying  on  hospital  size  and  teaching  intensity  would  eliminate  the  effects 
of  compression  because  these  two  variables  can  accurately  reflect  and  correct  -for 
the  effect  of  compression  on  hospitals.  These  findings  suggest  that  decompressing 
the  DRG  weights  would  result  in  increased  payments  to  large  hospitals  and  teaching 
hospitals.  We  will  return  in  Section  VII,  to  whether  decompressing  DRG  weights 
would  be  a  useful  way  to  respect  the  effect  of  hospital  size  on  costs  without 
actually  basing  payments  on  hospital  size. 

B. -SELECTING  AN  APPROPRIATE  DATA  BASE  FOR  CREATING  WEIGHTS.  DRG 
weight  compression  might  result  in  payment  errors  across  hospital  types.     Other 
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kinds  of  weighting  and  data  errors  could  simply  reduce  the  apparent  accuracy  of  the 
payment  system  without  introducing  systematic  biases  against  a  particular  group  of 
hospitals.  In  Chapters  2  and  3  we  pointed  out  that  using  actual  costs  as  a  proxy  for 
necessary  care  risks  errors;  Section  I  of  this  chapter  pointed  to  two  areas  (hospital 
size  and  outliers)  in  which  the  designers  of  PPS  determined  that  actual  costs  would 
not  be  used  as  a  proxy  for  appropriate  costs.  This  raises  the  question  of  whether 
unnecessary  or  misestimated  costs  included  in  the  data  base  have  an  overall 
distorting  effect  on  DRG  weights,  as  the  designers  of  PPS  thought  they  might. 

1.  Inclusion  of  Inappropriate  Care  in  the  Data  Base.  Gertman  and  Restuccia 
(Gertman  and  Bucher  1971,  Gertman  and  Restuccia  1981,  Restuccia  et  al.  198^^)  and 
Systemetrics  (1983)  have  shown  that  a  substantial  amount  of  care  is  delivered  in 
acute  hospital  beds  which  does  not  require  hospitalization.  Such  inappropriate  use 
means  that  current  patterns  of  care  do  not  represent  necessary  care.  Some 
evidence  also  suggests  that  medically  appropriate  services  are  not  always  rendered, 
and  that  costs  are,  therefore,  understated  for  some  conditions.  On  the  other  hand, 
Kay  et  al.  (1986)  found  evidence  (mentioned  in  Chapter  3)  that  these  errors  may  not 
distort  either  weights  nor  estimates  of  the  explanatory  power  of  DRGs.  Kay  et  al. 
found  that  removing  days  of  care  which  did  not  require  hospitalization  (nonacute 
days)  from  the  data  base  changed  neither  the  coefficient  of  variation  for  individual 
DRGs  nor  the  average  length  of  stay  for  these  DRGs.  Since  costs  were  not  available 
and  since  the  reliability  and  validity  of  the  instrument  used  have  not  been  entirely 
proven,  this  evidence  is  quite  preliminary.  Nevertheless,  the  findings  suggest  that 
variations  in  the  appropriateness  of  care  rendered  may  not  be  a  significant  problem 
in  either  creating  weights  or  in  assessing  the  power  of  the  system. 

2.  Problems  in  Pricing  Care.  Evidence  cited  in  our  discussion  of  weight 
compression  suggested  that  determining  accurate  prices  for  DRGs  could  be  very 


difficult.  Langenbrunner  (1986)  gives  an  example  suggesting  that  this  problem  might 
become  serious  under  some  circumstances.  Current  evidence  suggests  that  the  use 
of  flat  per-diem  rates  in  general  hospitals  results  in  systematic  subsidization  of 
pediatric  services  by  other  types  of  services.  This  appears  to  occur  both  because 
pediatric  units  have  characteristically  low  occupancy  rates  with  high  rates  of 
fluctuation  and  because  pediatric  nursing  requirements  are  more  intensive  than 
nonpediatric  requirements.  The  result  is  that  Medicare  weights  for  pediatric  DRGs 
are  probably  systematically  understated,  with  a  very  slight  complementary  inflation 
of  other  weights.  This  is  not  a  problem  in  paying  general  hospitals,  where  the  errors 
appear  to  balance  out,  but  might  result  in  serious  underpayment  to  pediatric 
specialty  hospitals,  where  there  are  no  nonpediatric  beds  to  subsidize  the  pediatric 
services.  A  similar  phenomenon  may  occur  in  pricing  psychiatric  services,  where 
the  per  diem  costs  of  acutely  ill  patients  are  averaged  with  those  of  the  chronically 
ill  or  custodial  patients  whose  care  is  not  part  of  the  Medicare  benefit  package.  In 
contrast  to  issues  of  weight  compression  and  "dirty"  calibration  data,  which  appear 
to  have  limited  significance,  distortion  of  weights  by  cross-subsidization  may  have  a 
significant  impact  on  the  equity  of  payment  under  PPS. 

V.  THE  "ACCURACY"  OF  PAYMENTS  IN  THE  FIRST  YEAR  OF  PPS 
An  important  source  of  evidence  on  hospital-level  factors  affecting  case  mix  is 
the  actual  relationship  of  PPS  payments  to  costs  as  well  as  the  resulting  profits  and 
losses.  Gains  or  losses  by  classes  of  hospitals  suggest  either  that  these  classes  of 
hospitals  have  systematic  management  strengths  or  weaknesses  or  that  factors  for 
which  PPS  does  not  fully  account  have  an  important  impact  on  the  system's  equity. 
Thus,  gains  and  losses  across  hospitals  that  are  not  consistent  with  the  objectives  of 
PPS  would  invite  a  reexamination  of  whether  PPS  accurately  reflects  the  real  world 
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in  which  hospitals  operate. 

The  accuracy  of  PPS  is  usually  measured  by  how  closely  payments  match 
current  costs,  but  this  criterion  must  be  very  carefully  applied.  The  possibility  of 
greater  efficiency  is  one  of  the  foundations  of  PPS,  so  a  perfect  match  between 
costs  and  payments  is  not  expected  and  would  probably  not  be  desirable.  On  the 
other  hand,  much  of  PPS  is  an  effort  to  model  current  costs  fairly  accurately,  so 
substantial  correspondence  between  costs  and  payments  is  expected  and  desirable. 
However,  the  analysis  already  presented  makes  clear  that  we  cannot  determine 
technically  the  exact  degree  of  correspondence  which  PPS  should  attain. 

This  section  analyzes  the  relation  between  costs  and  payments  in  four  ways: 
the  amount  of  variation  in  costs  which  is  reflected  in  payments,  the  operating 
margins  which  various  groups  of  hospitals  experienced,  the  number  of  hospitals  in 
different  groups  experiencing  various  operating  margin  levels,  and  the  relative 
association  of  different  hospital  characteristics  with  level  of  operating  margin. 
A.  METHODS.  Because  this  chafpter  focuses  on  hospital-level  issues,  all 
comparisons  will  be  made  for  hospital  average  costs  and  payments,  not  for 
costs  and  payments  at  the  individual  patient  level.  Operating  margin  at  the  hospital 
level  means  the  difference  between  costs  for  Medicare  patients  and  payments  made 
by  Medicare,  expressed  as  a  percent  of  Medicare  costs.  Operating  costs  and  margins 
exclude  both  capital  payments  and  payments  for  the  direct  expenses  of  medical 
education,  which  are  "passed  through"  outside  the  PPS  per-case  payment 
mechanism. 

All  analyses  in  this  section  are  based  on  actual  Medicare  cost  reports  for  the 
first  year  in  which  hospitals  were  paid  under  PPS.  These  cost  reports  were  used  in 
making  projections  of  the  impact  of  the  PPS  payment  regulations  for  Federal  fiscal 
year  1987.   The  file  contains  usable  cost  and  payment  information  for  ^5^^^^  hospitals 
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and  cost  information  alone  for  5350  hospitals;  only  a  small  fraction  of  these  reports 
had  been  finally  audited  and  settled  when  these  calculations  were  made. 

In  order  to  project  actual  payments  under  a  fully  implemented  PPS  system,  we 
took  the  national  component  of  1987  payments,  which  will  actually  comprise  75 
percent  of  payments  in  hospital  reporting  periods  beginning  in  1987,  and  assumed  a 
full  implementation  in  which  this  national  component  was  100  percent  of  payments 
(except  for  sole  community  hospitals).  We  further  adjusted  to  reflect  the  shift  in 
calculating  urban  and  rural  rates  which  will  become  effective  in  1988.  We  then 
applied  that  payment  rate  to  discharges  during  the  hospital's  first  PPS  year.  For 
estimates  of  costs  at  full  implementation  we  inflated  costs  from  1984  at  the 
inflation  rate  of  HCFA's  hospital  market  basket.  The  reader  should  notice  that 
though  we  have  projected  the  situation  under  full  implementation  we  have  no 
knowledge  of  what  changes  may  occur  in  regulations  between  1986  and  1988,  of  the 
actual  rate  of  inflation,  or  of  changes  in  hospital  productivity  between  1984  and 
1988.  We  have  applied  the  1987  "disproportionate  share"  adjustment  and  the 
adjustments  required  by  the  Omnibus  Budget  Reconciliation  Act  of  1986  (OBRA-86 
PL  99-509). 

We   used   estimates   developed    by   Guterman   (1986)  of   the   degree   to   which 

different  classes  of  hospitals  were  able  to  reduce  their  operating  costs  in  response 

to  PPS.    His  estimates  reflect  the  difference  between  the  cost  per  case  which  HCFA 

projected  for  each  hospital  in  establishing  hospital-specific  payment  rates  for  the 

first  year  of  PPS  and  the  actual  costs  reported  by  hospitals  for  that  year.    Because 

HCFA  projected  1984  costs  from  unaudited  1982  cost  reports  and  overestimated 

inflation,  the  resulting  cost  projections  are  somewhat  high  (PROPAC  1985).    This 

means  that  our  estimates  of  operating  cost  reductions  are  also  somewhat  high;  we 

discuss  these  issues  further  in  Section  E.l.b. 

B.     OVERALL  LEVELS  OF  MARGINS.    We  present  overall  margin  levels  in  several 

ways  so  that  readers  will  appreciate  the  range  of  estimates  which  can  be  made  from 
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a  single  set  of  data  and  so  that  they  will  not  give  excessive  weight  to  differences  of 
a  few  percentage  points  in  rnargin  levels  reported  by  different  sources.  We  can 
calculate: 

—  the  operating  margin  for  the  entire  hospital  industry.  To  do  this  we  compare 
all  PPS  operating  payments  with  all  Medicare  operating  costs;  by  this 
method,  overall  margin  in  198^  was  16.^  percent  and  at  full  implementation 
would  be  13.3  percent 

—  the  case-weighted  average  hospital  margin.  To  do  this  we  compute  a  margin 
for  each  hospital  and  then  calculate  an  average  in  which  each  hospital's 
margin  is  weighted  by  the  hospital's  number  of  Medicare  discharges.  This 
method  gives  15.8  percent  in  198^  and  16.0  percent  with  full  implementation. 

—  a  simple  average  of  hospital  margins.  To  do  this  we  compute  a  margin  for 
each  hospital  and  calculate  an  average  in  which  each  hospital  has  equal 
weight  regardless  of  size.  This  gives  11.6  percent  in  198^  and  17.7  percent 
with  full  implementation. 

Each  of  these  figures  has  advantages  and  disadvantages.  The  industry  figure 
reflects  the  relation  of  what  Medicare  is  paying  to  the  cost  of  producing  the  care  it 
pays  for;  it  is  least  /ulnerable  to  auditing  and  data  errors.  The  case-weighted 
average  gives  greater  weight  to  the  experience  of  a  large  hospital  than  to  that  of  a 
small  hospital,  while  the  hospital-weighted  average  gives  equal  weight  to  each 
hospital;  both  case-weighted  and  hospital-weighted  averages  are  quite  sensitive  to 
data  errors  and  to  how  extreme  data  values  are  trimmed.  For  example,  both 
averages  show  higher  margins  at  full  implementation  than  in  198^  while  the 
hospital-level  figure  shows  the  reverse  pattern.  This  occurs  because  very  high 
profits  have  been  trimmed  from  the  198^  data  but  not  from  the  full  implementation 
projections.      3y    any    computation,   however,    margins    are    higher    than    many   had 
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expected  at  the  beginning  of  PPS. 

1.  Reasons  for  High  Margins.    Several  forces  appear  to  have  created  the  high  overall 

margins: 

a.  Overestimated  Base  Year  Costs.  In  order  to  use  the  most  recent  data,  HCFA 
employed  unaudited  cost  reports  to  set  payment  rates;  such  reports 
characteristically  overstate  costs  or  include  costs  which  HCFA  ultimately  does  not 
allow.  A  General  Accounting  Office  (GAO  1985)  analysis  estimated  that  payment 
rates  were  ^.3  percent  higher  than  they  would  have  been  if  based  on  audited  reports. 

b.  Coding  Changes  which  Increased  Payments.  Upgraded  coding  resulted  in 
higher  average  case  mix  index  (CMI)  than  had  been  anticipated.  Carter  and 
Ginsburg  (1986)  conclude  that  the  higher  CMI  was  largely  attributable  to  secular 
trends  and  the  more  accurate  case  mix  measurement  made  possible  by  the  new 
Medicare  billing  form.  They  also  found,  however,  a  modest  but  sharp  increase  in 
case  mix  at  the  time  hospitals  went  on  PPS.  They  provide  evidence  that  this 
increase  was  a  one-time  phenomenon;  other  studies  of  coding  error  suggest  that  the 
observed  change  was  smaller  than  would  have  been  expected  if  previous  coding 
errors  had  been  entirely  corrected.  Carter  and  Ginsburg's  analysis  suggests  that  this 
change  in  coding  resulted  in  ^.9  percent  higher  margins  than  would  have  otherwise 
occurred. 

c.  Overestimated  Inflation.  In  setting  the  base  rates,  HCFA  estimated  a  higher 
inflation  rate  for  the  hospital  market  basket  than  actually  occurred. 

d.  Increased  Efficiency.  Finally,  hospitals  appear  to  have  become  more 
efficient.  They  have  decreased  length  of  stay  for  Medicare  patients,  decreased 
staffing,  and  taken  other  steps  to  provide  care  at  lower  costs.  They  have  also 
transferred  Medicare  costs  to  skilled  nursing  facilities  and  ambulatory  settings, 
producing  major  decreases  in  hospital  costs  but  less  certain  net  savings  to  Medicare. 
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These  changes  arc  docurnented  in  the  Annual  Impact  Reports  for  198^  and  1985. 
2»  HCFA  and  A-nerican  Hospital  Association  Estimates.  Our  estimates  lor  Medicare 
margins  differ  from  those  of  the  American  Hospital  Association  (AHA),  which 
estimated  total  rtet  margins  of  6.2  percent  in  198^*  and  5.9  percent  in  1985  and  net 
patient  margins  of  2.0  percent  in  198^  and  i.5  percent  in  1985  (Hospital  Research 
•nd  Educational  Trust  1985).  This  difference  between  HCFA  and  AHA  estimates 
reflects  important  differences  in  definition  and  in  the  services  described: 

a.  Definitions  of  Margin. 

—  AHA's  "net  patient  margin"  includes  costs  from  activities  other  than  patient 
care  but  excludes  revenues  from  these  sources.  AHA's  "total  net  margin" 
therefore  corresponds  better  to  usual  notions  of  profit  and  loss  and  is  a  better 
basis  for  comparison  with  HCFA  figures. 

—  Medicare  calculations  of  margins  are  limited  to  inpatient  services  to 
Medicare  beneficiaries.  By  contrast,  AHA  figures  include  all  hospital 
activities  (inpatient,  outpatient,  and  nonpatient  care)  and  all  groups  of 
patients.  Although  ancillary  nonpatient  care  may  be  profitable,  outpatient 
business  is  traditionally  unprofitable. 

—  Medicare  calculations  exclude  capital  and  direct  medical  education  from  both 
costs  and  payments  while  AHA  includes  them.  Because  Medicare  pays  both 
of  these  items  at  cost,  including  them  in  both  costs  and  revenues  necessarily 
brings  the  calculated  margin  nearer  to  zero. 

—  AHA  calculates  margins  as  a  percentage  of  revenues  while  HCFA  calculates 
margins  as  a  percentage  of  costs.  When  revenues  exceed  costs,  as  they  did  by 
both  AHA  and  HCFA  calculations,  the  AHA  method  of  calculation  always 
yields  a  lower  margin. 

b.  Differences  in  Margins  for  Medicare  and  Non-Medicare  Patients.   For  several 
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reasons,  profits  on  Medicare  business  were  higher  than  profits  on  non-Medicare 
business: 

—  Medicare  paid  more  than  intended  in  the  first  year  of  PPS,  which  was 
probably  not  generally  true  of  other  payers 

—  Medicare  incentives  were  intended  to  let  hospitals  profit  by  reducing  length 
of  stay,  eljrninating  unnecessary  services,  and  providing  services  more 
efficiently;  other  payers  sought  to  control  costs  primarily  by  reducing  the 
number  of  services  rendered  while  continuing  to  pay  on  the  basis  of  services 
actually  rendered. 

—  Hospital  revenue  on  non-Medicare  business  is  reduced  by  bad  debt  and 
discounts  to  insurers.    Medicare  pays  for  bad  debt  on  Medicare  cases 

and  does  not  demand  discounts. 
C.   HOU     WELL    DO    PPS    PAYMENTS    REFLECT    DIFFERENCES   IN   OPERATING 
COSTS'!*    In  the  first  year  of  PPS,  payments  reflected  89.0  percent  of  the  variation 
in  costs.     When  PPS  is  fully  implemented,  we  estimate  that  payments  will  reflect 

yi.**    percent   of   variation   in    198'*    hospital    costs    (the    percent   of    the    variation 

2 

explained   is  R   ,  which  is  the  square  of  the  simple  correlation  between  costs  and 

revenues  at  the  hospital  level).  We  cannot  predict  how  well  payments  will  reflect 
the  hospital  costs  which  cyist  at  the  time  of  full  implementation  because  we  cannot 
predict  what  changes  hospitals  may  make  in  their  efficiency  between  198'*  and  19SS 
in  response  to  the  incentives  of  PPS  or  other  pressures. 

PPS  payment  variables  (urban/rural,  teaching,  wage  index,  and  case  mix  index) 
account  for  no  more  of  the  variation  in  hospital  costs  in  19S^  than  then  they  did  in 
1981  (69.2  percent  in  1981  v.  68.2  percent  in  198«<;  Table  4.1).  The  decrease  is  small 
but  does  not  suggest  that  hospitals  are  becoming  more  alike  through  responding  to 
the  PPS  incentives. 
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These  cost  regressions  in  Tables  ^.1  and  l^■.2  show  the  limit  of  how  well  the  PPS 
payment  variables  can  be  used  to  make  payments  match  costs.  The  two  additional 
variables  not  included  in  the  original  calculations  are  disproportionate  share 
payments  and  outlier  payments.    Table  ^.2  shows  that,  when  they  are  included,  the 

equation  explains  73.7  percent  of  cost;  this  roughly  matches  the  71.^  percent  of  cost 

2 
variation  which  is  reflected  in  payments.    The  small  differences  between  the  R     m 

the  regression  equation  and  the  actual  relation  of  costs  to  payment  reflects  two 

kinds  of  differences  between  the  equations  and  actual  payment  policy: 

—  the  equations  do  not  fully  take  account  of  the  unique  position  of 
sole  community  hospitals  and  rural  referral  centers,  and 

—  the  payment  system  is  not  constructed  from  the  cost  equation  and,  in 
particular;  the  equation  implies  outlier  payments  about  2.5  times  larger  than 
HCFA  pays  and  disproportionate  share  payments  about  an  eighth  of  those 
defined  by  law.   We  return  to  the  outlier  question  in  the  next  section. 

The  equity  of  PPS  depends  on  the  significance  of  that  cost  variation  which  will 
not  be  reflected  in  payments  when  the  system  is  fully  implemented — an  estimated 
28.6  percent.  If  the  variation  which  is  not  reflected  in  payments  results  from 
factors  such  as  inefficiency,  for  which  Medicare  should  not  pay,  then  PPS  is 
equitable;  if  this  variation  results  from  factors  such  as  variation  in  severity  of 
illness  within  DRGs,  for  which  Medicare  should  pay,  then  there  is  a  problem. 
Unfortunately,  we  cannot  currently  determine  which  of  these  two  factors 
predominates,  so  technical  procedures  cannot  fully  determine  the  equity  of  PPS. 
Horn  et  al.(1985a,  1983b)  have  argued  that  interhospital  differences  in  severity 
account  for  a  large  part  of  the  difference  between  hospital  costs  and  PPS  payments, 
but  their  sample  was  small  and  their  methods  are  controversial  (Davis  1985,  Mullin 
1985,  Horn  1985c). 
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Thus  a  judgment  on  the  equity  of  PPS  payments  involves  an  ultimately 
nontechnical  balancing  of  the  apparent  equity  of  using  uniform  national  payment 
rates  against  uncertainty  whether  uniform  payments  are  truly  equitable. 

As  a  result  of  this  technical  uncertainty,  many  critics  judge  the  equity  of  PPS 
by  how  many  hospitals  are  hurt,  which  ultimately  depends  on  the  absolute  level  of 
Medicare  payments.    From  this  perspective,  a  more  detailed  picture  of  how  many 
hospitals  win  and  lose  is  illuminating. 
D.  WINNING  AND  LOSING  HOSPITALS. 

1.  Magnitudes  and  Distributions  of  Margins.  Table  ^.7  shows  average  margin  by 
class  of  hospital.  These  margins  are  computed  using  aggregated  payments  and  costs 
for  the  entire  group  of  hospitals.  Although  overall  margins  appear  strong,  Tables  ^.8 
and  k.9  show  that  there  are  significant  numbers  of  hospitals  with  losses,  and  even 
substantial  losses,  in  each  group.  Overall,  19.5  percent  of  hospitals  had  losses  in  the 
first  year  of  PPS  and  7 A  percent  had  losses  greater  than  10  percent  of  costs.  At 
full  implementation,  23.3  percent  would  have  losses  and  10.6  percent  would  lose 
more  than  10  percent.  The  projected  losses  under  full  implementation  are,  of 
course,  very  uncertain  because  potential  losers  are  very  likely  to  change  their 
practice  patterns  to  avoid  such  losses. 

The  range  of  margins  grows  under  full  implementation — that  is,  there  are  more 

big  losers  and  even  more  big  winners:  the  number  losing  more  than  20  percent  goes 

from  3.0  percent  to  3.7  percent  (from  160  hospitals  to  197)  and  the  number  profiting 

more  than  30  percent  goes  from  9.5  percent  to  26.0  percent  (from  432  hospitals  to 

1388).    This  is  expected:  PPS  was  designed  with  a  set  of  cushions  to  protect  hospitals 

during  transition   to  national   rates,  and  those  cushions  will  be  phased   out   when 

payments  are  fully  national.    Nevertheless,  these  findings  underline  the  questions  of 

what  the  big  winners  will  do  with  their  margins  and  whether  the  big  losers  will 

reduce  access  to  adequate  patient  care. 
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2.  Implications  of  Operating  Margins.  The  impact  of  a  particular  operating 
margin  depends  both  on  the  relative  margins  of  other  hospitals  and  on  the  overall 
changes  in  margin  which  may  result  from  future  changes  in  Medicare  payment 
policy.  Relative  margins  are  important  because  hospitals  with  relatively  low 
margins  may  have  difficulty  securing  capital  and  other  needed  forms  of  support. 
Absolute  margins  are  important  not  only  for  their  immediate  dollar  value  but  also  as 
a  cushion  if  payment  policy  changes.  For  example,  if  Medicare  payments  were 
adjusted  so  that  overall  operating  margins  dropped  from  13.3  percent  to  10  percent 
at  full  implementation,  then  hospitals  which  now  have  margins  less  than  3.0  percent 
would  show  a  loss  on  Medicare  business.  Thus,  tightening  up  on  the  PPS  annual 
update  factor  would  not  only  reduce  average  margins  but  also  produce  more  losers 
unless  hospitals  which  now  have  borderline  margins  could  achieve  additional 
economies.  Under  national  payment  rates,  we  project  that  trimming  overall 
operating  margin  from  13.3  percent  to  10.0  percent  would  increase  the  percentage 
of  hospitals  with  operating  losses  on  Medicare  business  from  23.3  percent  to  32.2 
percent  of  all  hospitals. 

E.  PATTERNS  OF  MARGINS.  Tables  4.7-^.9  show  that  large,  urban,  and  teaching 
hospitals  did  better  in  the  first  year  of  PPS  than  small,  rural  and  nonteaching 
hospitals.  Table  ^.10  shows  a  regression  analysis  of  the  factors  which  are  associated 
with  hospital  margins.  This  table  summarizes  the  effect  of  each  hospital 
characteristic  on  operating  margin  when  all  other  characteristics  are  held  constant. 
Thus,  for  example,  the  difference  in  inargin  between  urban  and  rural  areas  is 
separated  from  the  effect  of  different  hospital  sizes  in  urban  and  rural  areas.  The 
first  point  to  notice  about  this  analysis  is  that  the  associations,  although 
statistically  very  significant,  are  relatively  weak.  The  predictors  account  for  no 
more   than    16  percent  of    total   variation    in    margins   either   in    198^   or    at    full 


implementation.  In  addition: 

1.  Rural.  An  urban  hospital  would  have  10  percent  higher  profits  than  a  rural 
hospital  under  national  rates  if  it  had  otherwise  similar  characteristics;  but  rural 
hospitals  have  lower  wage  rates,  which  is  associated  with  higher  profits.  The  net 
effect,  as  tables  'f.S  and  ^.9  show,  is  that  urban  and  rural  hospitals  will  differ  little 
in  profitability  after  adjusting  for  differences  in  teaching  status  and  size. 

2.  Outliers.  Including  outliers  in  the  variables  used  to  predict  profits  shows 
that  having  outlier  cases  is  associated  with  losses  in  198^  and  is  much  more  closely 
associated  with  losses  at  full  implementation.  Because  we  cannot  tell  whether 
having  outlier  cases  indicates  difficult  case  mix  or  inefficiency,  we  rely  on  the 
analysis  which  omits  outliers  in  the  following  discussion. 

3.  Efficiency  Changes  in  Small  and  Rural  Hospitals.  Table  ^.11  shows  that  both 
urban  and  rural  hospitals  held  their  costs  below  the  levels  which  HCFA  estimated  at 
the  beginning  of  PPS.  As  we  have  mentioned,  these  "savings"  partly  reflect  HCFA's 
overestimating  costs  and  partly  reflect  coding  changes,  but  the  difference  between 
HCFA  estimates  and  reported  actual  costs  is  far  larger  than  any  error  which  has 
been  suggested  and  a  substantial  part  probably  represents  real  savings.  Rural 
hospitals  achieved  an  average  of  9.5  percent  savings  in  costs  during  the  first  PPS 
year  compared  to  18.0  percent  savings  for  the  average  urban  hospital.  Although  this 
difference  in  savings  may  partly  reflect  the  smaller  size  of  rural  hospitals,  it 
probably  also  reflects  the  fact  that  rural  hospitals  were  already  delivering  a  leaner 
service  mix  (Cromwell  et  al.  1985).  Reducing  services  may  be  much  more  difficult 
for  rural  hospitals.  As  urban  hospital  costs  come  closer  to  those  of  rural  hospitals, 
deciding  whether  they  should  retain  the  resulting  profits  will  pose  tricky  problems  of 
equity.  A  forthcoming  report  to  Congress  will  address  these  urban-rural  issues  in 
detail. 


3.  Disproportionate  Share  Hospitals.  Disproportionate  share  hospitals,  contrary 
to  expectations,  had  slightly  higher  margins  in  198^  than  otherwise  similar  hospitals 
without  a  disproportionate  share  of  poor  patients.  Table  ^.0  shows  that  when  these 
hospitals  receive  a  premium  payment,  their  margins  will  rise  further  above  average. 
Disproportionate  share  hospitals  tend  to  be  those  (large,  urban,  and  teaching 
hospitals)  which  realized  greatest  efficiencies  in  198^.  The  margins  may  reflect 
these  savings,  since  Table  ^.2  shows  that  such  hospitals  do  indeed  have  slightly 
higher  costs  than  otherwise  similar  hospitals. 

^.  Hospital  Size.  Table  ^.10  shows  that  hospitals  with  fewer  than  50  beds  had 
slightly  lower  margins  than  larger  hospitals  in  198'f  but  that  these  hospitals  will  have 
7-1^  percent  higher  margins  under  national  rates.  This  increased  margin  under 
national  rates  reflects  the  fact  that  the  historical  costs  of  small  hospitals  were 
about  10-25  percent  below  those  of  larger  hospitals  (Tables  ^.1-^.2).  In  198^,  the 
difference  in  historical  costs  gave  small  hospitals  only  a  slight  advantage  because 
payments  were  75  percent  based  on  the  historical  costs  of  the  individual  hospital. 
The  historically  lower  costs  of  small  hospitals  will  translate  into  a  substantial 
windfall  when  payments  are  based  on  national  averages  under  fully  national  rates. 
The  higher  margins  of  small  hospitals  reflects  lower  historical  costs  rather  than 
increased  efficiency  since  the  inception  of  PPS.  Table  4.11  shows  that  they  have 
achieved  less  savings  than  large  hospitals  under  PPS,  Section  VII  examines  the 
problem  of  hospital  size  further. 

5.  Margins  in  High  CMI  Hospitals.  In  1984  margins  were  higher  at  hospitals  with 
high  Case  Mix  Indices  (CMI).  This  relationship  disappears  in  the  projection  of 
margins  at  full  implementation.  We  cannot  prove  why  high  CMI  hospitals  had  high 
margins,  but  two  explanations  are  plausible: 

—  Hospitals  with  a  high   CMI  have  more  ability  to  achieve  economies.     This 
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interpretation  is  consistent  with  the  economies  achieved  in  large,  urban,  and 

teaching  hospitals,  all  of  which  have  higher  CMIs. 

—  Cretin  et  al.  (1985)  suggest  that  CMI  "creep"  has  occurred  selectively.    If 

creep  was  greater  in  hospitals  with  CMIs  below  the  CMIs  of  otherwise  similar 

hospitals  but  was  distributed  randomly  across  hospital  types,  then  hospitals 

with  creep  would  tend  to  do  better  and  CMI  would  be  positively  related  to 

profit  under  198^  rates  but  not  under  national  rates.    Under  198^*  rates,  the 

improvement  over   the   historical   CMI   would   produce    profit;    but   a    move 

toward  typical  CMI  for  that  group  of  hospitals  would  not  produce  greater 

profit  under  national  rates.    This  speculative  line  of  reasoning  is  consistent 

with    the    hypothesis    that    creep    consists    more    in    correcting    erroneous 

undercoding  than  in  inflating  appropriate  coding. 

At  first  glance,  these  findings  do  not  appear  to  support  the  hypothesis  that  DRG 

weights  are  "compressed",  since  that  hypothesis  appeared  to  predict  losses  in  high- 

CMI  hospitals.     But  we  have  already  shown  that  the  compression  of  DRG  weights 

manifests   itself   in   overpayments   to   small   hospitals   and   underpayment   to   large 

hospitals,    so    we    expected    to    find    higher    margins    in    small    hospitals    at    full 

implementation  and  we  would  not  expect  large  losses  in  high-CMI  hospitals  once  we 

have  controlled  for  differences  in  hospital  size. 

6.  Teaching  Hospitals.  Table  (i.9  shows  and  Table  ^.10  confirms  that  teaching 
hospitals  have  are  substantially  higher  margins  than  others  and  that  this  trend  will 
persist  under  projected  fully  national  rates.  The  greater  impact  of  teaching  on 
margins  at  national  rates  is  somewhat  unexpected  because  legislation  has  lowered 
the  teaching  adjustment  from  11.6  to  8.7  percent.  The  explanation  is  that  even  the 
lower  teaching  adjustment  is  profitable  and  that  teaching  hospitals  are  more 
profitable  when  paid  at  national   rates  than   when   paid  at  their  historical  costs. 

^-37 


Table  4-5.6  shows  that  teaching  hospitals  have  achieved  very  large  economies 
relative  to  their  historical  costs;  because  the  teaching  adjustment  is  based  on  their 
historical  costs,  it  provides  increases  in  margins. 

7.  State  and  Regional  Differences.  Table  4.12  shows  that  even  after  controlling 
for  differences  in  hospital  characteristics  there  were  some  differences  in  operating 
margins  across  states  and  regions  in  the  first  year  of  PPS  and  that  these  differences 
will  be  substantial  under  national  rates.  For  example,  margins  in  Mississippi, 
Kentucky,  Tennessee,  Alabama,  Nebraska,  North  Dakota,  South  Dakota,  Utah,  Idaho. 
Montana,  Colorado,  Wyoming,  Oregon,  and  Washington  will  be  7-14  percentage 
points  higher  than  margins  in  the  New  England  States,  California,  Florida,  Illinois, 
and  Michigan.  These  differences  in  operating  margins  primarily  reflect  differences 
in  costs  which  existed  before  PPS  and  to  a  far  smaller  extent  reflect  differences  in 
efficiencies  realized  under  PPS. 

E.  SUMMARY.  We  have  already  pointed  out  that  we  do  not  have  the  technical  tools 
which  would  be  necessary  to  determine  whether  any  of  these  patterns  of  profit  and 
loss  are  equitable.  At  the  same  time,  some  are  unexpected  and  each  must  be 
carefully  examined  to  determine  whether  it  conforms  to  the  purposes  of  PPS.  The 
fact  that  almost  a  third  of  variation  in  cost  is  not  reflected  in  variation  in  payment 
demands  both  further  research  and  continued  careful  monitoring  of  the  impact  of 
the  system. 


VI.  OUTLIER  PAYMENTS  AND  EQUITY  TO  HOSPITALS 
As  we  indicated  at  the  beginning  of  this  chapter,  the  outlier  payment  system 
was  shaped  by  three  forces:  a  need  to  protect  hospitals  from  random  risk,  a  need  to 
protect  certain  hospitals  with  large  numbers  of  outliers  from  the  fact  that  the  DRG 
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system  could  not  classify  such  cases,  and  a  need  to  protect  the  integrity  of  the 
incentives  for  efficiency  which  were  created  by  prospective  payment  by  the  case. 
A.  OUTLIERS  AND  MARGINAL  COSTS.  PPS  legislation  defines  marginal  cost  as 
the  basis  for  outlier  payment,  but  the  concept  of  marginal  costs  is  often  puzzling 
and  elusive.  The  economic  definition  would  be  that  the  marginal  cost  of  the  outlier 
case  would  be  the  cost  of  the  case  if  all  the  fixed  costs  of  the  hospital  (such  as 
buildings,  equipment,  and  perhaps  some  administrative  staff)  are  excluded  and  the 
only  costs  counted  are  those  specific  to  the  patient  (such  as  most  staff,  dietary, 
pharmaceuticals,  and  housekeeping).  In  some  industries,  such  as  electric  utilities, 
the  difference  between  fixed  and  marginal  costs  is  fairly  obvious  and  fairly  easy  to 
define,  but  this  is  not  true  in  hospitals.  When  the  original  PPS  regulations  were 
written,  evidence  was  available  to  argue  that  marginal  cost  represented  anywhere 
from  20  to  90  percent  of  average  costs  for  all  patients.  More  recent  research 
(Pauley  1985  and  unpublished  work  by  Paul  Ginsburg  of  the  Rand  Corporation) 
suggests  that  marginal  costs  is  very  close  to  average  cost  in  the  hospital  industry. 
This  finding  is  unexpected  in  an  industry  where  the  cost  of  equipment  and  facilities 
receives  so  much  publicity,  and  it  probably  reflects  three  factors: 

1.  Actual  Hospital  Management  Practice  Has  Changed.  The  60  percent 
estimate  was  based  on  the  best  available  research,  but  that  was  research  that  was 
generally  10  to  25  years  old.  Staffing  levels  are  now  much  more  closely  matched  to 
changing  workload;  empty  beds  and  wards  are  not  left  staffed.  Because  more  than 
7Q  percent  of  hospital  cost  is  labor-related,  this  change  in  practice  has  great 
impact. 

2.  Additional  Work  Often  Cannot  be  Scheduled  in  Hospitals.  Because  additional 
work  tends  to  be  the  result  of  medical  necessity,  it  cannot  be  scheduled  and  often 
has  the  disruptive  effect  of  reducing  efficiency  instead  of  using  idle  capacity. 
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3.  "Fixed"  Costs  Vary  With  Volume.  Economists  usually  estimate  marginal 
costs  by  examining  the  costs  across  hospitals  as  either  number  of  admissions  or 
length  of  stay  increases.  But  as  either  of  these  increases,  the  sophistication  of 
hospital  facilities,  cost  of  capital  equipment,  complexity  of  hospital  organization, 
and  expense  of  overhead  also  tend  to  increase.  Thus,  the  traditional  economic 
concept  of  marginal  cost  becomes  difficult  to  apply. 

Thus,  research  evidence  suggests  that  a  new  estimate  of  marginal  cost  may  be 
needed.  Empirical  evidence  from  the  first  year  of  PPS  bears  on  this  issue  as  well  as 
the  broader  issue  of  the  impact  of  outlier  policy. 

Guterman  (1986)  has  analyzed  the  distribution  of  outlier  cases  and  payments 
during  the  first  year  of  PPS.  He  had  information  as  to  hospital  charges  for  each 
case,  Medicare  payment  (except  for  direct  medical  education  and  capital  payments), 
and  hospital  and  patient  characteristics.  He  estimated  operating  costs  by  applying 
hospital-level  cost/charge  ratios  which  had  been  generated  by  Cretin  et  al.  (1986)  in 
the  Rand  synthetic  cost  study.  Omitting  capital  may  make  the  concentration  of 
outlier  payments  in  large,  urban,  and  teaching  hospitals  appear  slightly  larger  than  it 
really  is.  Because  the  file  was  complete  through  April  1985,  very  few  cases  are 
omitted.  Guterman  identified  the  difference  between  charges  and  payments  as  a 
critical  variable  and  examined  its  distribution  across  hospitals  and  beneficiaries. 
B.  FINDINGS.   Guterman  found: 

1.  Outliers  account  for  the  majority  of  the  difference  between  charges  and 
payments  under  PPS.  The  2.96  percent  of  cases  which  were  classified  as  outliers 
accounted  for  56.02  percent  of  the  difference  between  hospital  charges  and 
Medicare  payments  (Table  ^.13). 

2.  Outliers  have  very  high  charges  and  very  high  losses.  Outlier  cases  are  very 
expensive:  2.96  percent  of  cases  account  for   17.66  percent  of   Medicare  covered 


charges,  17.19  percent  of  estimated  operating  costs,  but  only  6.35  percent  of 
payments.  Average  estimated  costs  for  outliers  were  $16,819,  which  is  6.80  times 
the  average  nonoutlier  cost  of  $2,^73.  The  average  payment  for  an  outlier  case  was 
$7,2^^8,  resulting  in  an  estimated  loss  of  $9,573;  this  loss  contrasts  with  an  estimated 
profit  of  $791  on  nonoutlier  cases  (Table  ^.1^).  This  estimate  is  slightly  overstated 
because  the  payments  estimate  omits  capital  and  direct  medical  education 
payments.  Nevertheless,  having  a  large  percentage  of  outliers  was,  all  other  things 
being  equal,  associated  with  significantly  lower  profits  in  the  first  year  of  PPS 
(Table  ^^.8). 

3.  Outliers  appear  to  have  more  severe  illness.  Outlier  cases  appear  to  be 
sicker  than  nonoutliers,  even  in  proportion  to  their  longer  stays.  Routine  daily  room 
charges  are  38.0  percent  of  the  bill  for  the  typical  patient  and  21.3  percent  for 
outliers;  intensive  care  charges  are  6.36  percent  of  typical  bills  and  11.8  percent  of 
outlier  charges;  ancillary  charges  are  55.1  percent  of  charges  for  nonoutliers  and 
66.9  percent  for  outliers  (Table  k.\k), 

^.  PPS  day  and  cost  outliers  have  similar  charges  but  PPS  pays  more  for  cost 
outliers.  Despite  the  very  large  difference  in  how  day  and  cost  outliers  are 
identified,  total  charges  are  rather  similar  for  the  two  groups:  $26,151  for  day 
outliers  and  $26,012  for  cost  outliers,  as  were  estimated  costs  ($17,116  v.  $16,385) 
total.  But  PPS  paid  an  estimated  51.3  percent  of  costs  for  cost  outliers  and  37.7 
percent  of  costs  for  stay  outliers  (Table  ^.H). 

5.  PPS  paid  less  than  planned  under  the  outlier  payment  system.  As  the  198^ 
PPS  Impact  Report  noted,  the  Department's  target  for  outlier  payments  was  5J 
percent  of  the  "Federal  portion"  of  PPS  payments,  or  about  2.1  percent  of  total  PPS 
payments  (the  target  amount  was  slightly  more  than  5J  percent  of  the  Federal 
portion  because  of  delays  in  publishing  a  regulation).    Actual  payments  under  the 


outlier  system  were  about  1.1  percent  of  total  payments,  or  about  53  percent  of  the 
target.  Thresholds  were  changed  in  1985  to  correct  this  discrepancy,  and  the 
decreasing  role  of  the  hospital-specific  portion  in  payments  will  also  increase  outlier 
payments  per  case. 

6.  Outliers  are  concentrated  in  large,  urban,  and  teaching  hospitals.  Outlier 
cases,  as  well  as  the  corresponding  difference  between  charges  and  payments,  were 
concentrated  in  large,  urban,  and  teaching  hospitals  (Tables  4.14).  To  some  extent, 
this  concentration  of  high  risk  cases  is  offset  by  the  urban/rural  and  teaching 
adjustments,  but  the  random  risk  and  the  gap  between  costs  and  payments  are  a 
source  of  general  comment  and  concern.  On  the  other  hand,  the  combination  of  cost 
and  day  outliers  resulted  in  a  relatively  even  geographic  distribution  of  payments 
(Table  4.14),  with  cost  outliers  concentrated  in  the  western  part  of  the  country  and 
day  outliers  concentrated  in  the  eastern  part,  which  was  part  of  the  intent  of  the 
design. 
C.  INTERPRETATION  OF  FINDINGS. 

1.  Outliers  account  for  the  majority  of  the  difference  between  charges  and 
payments  under  PPS.  Guterman's  analysis  shows  that  this  difference  comes  from 
four  sources: 

—  Medicare  always  pays  less  than  charges.  Hospital  charges  always  exceed 
Medicare  payment  because  PPS  payments  reflect  only  allowable  operating 
costs,  and  hospitals  typicallv  charge  39  percent  more  than  their  allowable 
operating  costs.  This  "markup"  varies  across  hospital  types,  somewhat 
reducing  the  precision  of  the  findings. 

—  Omission  of  capital  and  direct  medical  education  payments.  This  increases 
the  apparent  difference  between  charges  and  payments,  especially  because 
outlier  cases  are  concentrated  in  teaching  hospitals. 
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—  PPS  never  intended  to  pay  above-average  costs  incurred  before  the  patient 
reached  the  cost  or  day  outlier  threshold. 

—  The  marginal  cost  computed  by  Medicare  appears  to  differ  from  the  actual 
increments  cost  of  care  provided  for  care  l>eyond  the  outlier  threshold. 
Guterman  was  unable  to  t>reak  down  these  four  components  precisely,  but  his 
analysis  suggested  very  strongly  that   PPS  did  not   protect   hospitals  fully 
against  the  marginal  costs  of  outliers. 

2.  Outliers  have  very  high  charges  and  very  high  losses.  Fear  that  hospitals 
would  find  a  generous  outlier  policy  profitable  shaped  the  way  the  designers  of  PPS 
crafted  the  outlier  payment  system.  However,  outlier  payments  could  probably  be 
made  more  generous  without  giving  hospitals  an  incentive  either  to  keep  patients 
past  the  outlier  threshold  or  to  recruit  outlier  cases. 

3.  Outliers  appear  to  have  more  severe  illness.  The  architects  of  PPS  judged 
that  marginal  daily  costs  of  outlier  cases  were  60  percent  of  average  daily  costs 
l)ecause  they  assumed  that  outliers  who  had  passed  the  threshold  were  like  typical 
patients  nearing  the  end  of  a  routine  stay.  The  greater  concentration  of  charges  in 
special  care  and  ancillary  services  suggests  that  outliers  not  only  stay  in  the  hospital 
longer  t)ut  are  sicker  on  average  throughout  their  stay.  This  suggests  that  the 
marginal  cost  estimates  may  l>e  significantly  lower  than  actual  marginal  cost.  For 
exceedingly  ill  patients,  especially  cost  outliers,  the  marginal  cost  may  be  further 
underestimated  because  of  the  cross-subsidies  inherent  in  per  diem  care  charges, 
which  we  discussed  in  Section  III.  On  the  other  hand,  it  is  not  possible  to  separate 
out  those  costs  which  occur  after  the  outlier  threshold  is  reached,  so  precise 
estimates  are  impractical. 

4.  PPS  identifies  stay  and  cost  outliers  in  similar  ways  txit  pays  them 
differently.  The  fact  that  stay  and  cost  outlier  threshholds  identify  patients  with 
similar  costs  suggests  equity.    Because  PPS  pays  a  substantially  larger  fraction  of 
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costs  for  cx>st  outlier  than  for  stay  outliers  may  mean  that  a  hospital  loses  a 
substantial  amount  on  the  day  that  a  patient  who  would  have  t>een  paid  as  a  cost 
outlier  crosses  the  stay  outlier  threshold  and  is  paid  as  a  stay  outlier. 

5.  PPS  paid  less  than  planned  under  the  outlier  payment  system.  PPS  paid  about 
53  percent  of  what  it  planned  for  outliers.  In  addition,  the  Department's  target  was 
based  on  the  Federal  portion,  which  will  allow  the  target  amount  to  increase 
substantially  as  the  Federal  portion  increases.  We  have  sought  to  expand  payment 
under  current  law  as  much  as  we  have  discretion  to  do. 

6.  Outliers  are  Unevenly  Distributed  Across  Hospital  Types.  This  uneven 
distribution  poses  two  problems.  First,  it  means  that  even  the  law  of  large  numbers 
will  not  protect  classes  of  hospitals  from  outlier  risks.  Second,  and  of  immediate 
practical  importance,  the  adjustments  for  urban  location  and  indirect  teaching  costs 
were  written  to  take  account  of  the  tendency  of  these  hospitals  to  have  more  than 
their  share  of  outliers.  When  outlier  payments  were  added  on  top  of  those 
adjustments,  hospitals  were  effectively  paid  twice. 

D.  THE  RELATION  OF  THE  FINDINGS  TO  OUTLIER  POLICY.  Congress  limited 
outlier  payments  to  5-6  percent  of  total  payments.  (Although  Medicare  actually 
paid  much  less  than  was  authorized,  this  was  the  result  of  dramatic  changes  in 
length  of  stay  which  could  not  have  t)een  fully  anticipated  and  of  a  decision  not  to 
apply  outlier  payments  to  the  hospital-specific  part  of  the  payment.) 
PPS  legislation  limited  outlier  payments  to  the  "marginal"  cost  of  services  provided 
after  the  patient  reached  the  "outlier  threshold."  By  the  time  the  threshold  is 
crossed,  charges  are  already  substantially  greater  than  Medicare  payments. 
Also,  any  case  which  could  t>e  paid  as  a  "day  outlier"  would  not  be  paid  as  a  "cost 
outlier."  This  meant  that  Medicare's  payment  for  a  very  expensive  case  which  would 
have  t)een  a  cost  outlier  decreased  sharply  when  that  patient  stayed  a  day  longer 
than  the  day  outlier  threshold  and  was  therefore  paid  as  a  day  outlier.    HCFA  is 
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developing  a  revised  payment  approach  for  outliers  which  is  meant  to  mitigate  this 
problem. 

Nevertheless,  the  legislative  mandate  set  the  volume  of  outlier  payments  and 
the  fact  that  they  were  to  pay  marginal  costs.  The  Guterman  results  invite  us  to 
address  seven  questions  in  light  of  the  new  data. 

1.  Does  the  Current  Outlier  Formula  Focus  Payments  Effectively?  If  outlier 
payments  are  limited  to  a  fixed  percentage  of  payments,  they  should  be  focused  in 
ways  that  best  support  the  goals  of  the  system: 

—  To  protect  hospitals  against  random  risk,  payments  should  t>e  focused  on 
exceptionally  expensive  cases  occurring  in  smaller  hospitals; 

—  To  protect  hospitals  whose  case  mix  has  expensive  features  unmeasured  by 
DRGs,  payments  should  focus  on  those  DRGs  or  conditions  where  the  greatest 
differences  in  outlier  costs  can  t>e  shown  across  hospitals  and  hospital  types; 
and 

—  To  protect  l)eneficiaries  against  discrimination,  payments  should  focus  on 
patients  who  might  t>e  identified  at  admission  as  having  an  unusually  high  risk 
of  becoming  outliers. 

Each  of  these  goals  requires  a  specific  strategy,  and  the  current  formula  provides 
little  focusing  of  payments.  For  all  three  goals  outlier  payments  might  provide 
more  efficient  protection  if  they  paid  an  increasing  fraction  of  costs  as  the  gap 
between  costs  and  DRG-based  payment  increases.  This  concept  of  increasing 
protection  with  increasing  losses  has  been  explored  in  some  detail  by  Frank  and  Lave 
(1985)  and  McGuire  et  al.  (1985)  for  psychiatric  facilities  and  appears  to  have 
potential  application  to  all  outliers. 

2.  Can  Cost  Based  Outlier  Payments  Be  Increased  Without  Compromising  PPS? 
Evidence  as  to  the  magnitude  of  hospital  losses  on  outlier  payments  and  the  overall 
cost  of  these  cases  suggests  perhaps  an  increase  could  be  prudently  made.   Medicare 
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expects  and  appears  to  be  getting  cost-conscious  behavior  from  hospitals  in 
nonoutlier  cases  with  far  less  stringent  payment  rules.  A  numt>er  of  the  Blue  Cross 
plans  and  State  Medicaid  ^ans  that  have  adopted  DRGs  for  their  Medicare 
programs,  have  used  more  liberal  outlier  policies  than  PPS. 

3.  Can  a  Severity-of-Illness  System  Replace  the  Current  Outlier  Payment 
System?  One  solution  to  outlier  payment  would  be  to  replace  it  with  a  case  mix 
system  which  deals  accurately  with  outliers,  something  which  the  present  DRG 
system  is  inherently  unable  to  do.  While  we  have  indicated  the  limitations  of 
severity  measures  with  regard  to  many  kinds  of  interhospital  differences,  such 
measures  appear  well  suited  to  outlier  cases,  where  severity  of  illness  is  clearly  a 
major  issue.  For  paying  outliers,  it  might  t>e  reasonable  to  rely  almost  exclusively 
on  a  severity-based  system  where  costs  of  collecting  special  data  would  De  very 
small  compared  to  the  payment  amounts  involved. 

However,  outliers  are  likely  to  turn  out  to  l>e  largely  patients  who  fail  to 
respond  to  therapy  and  whose  costs  are,  therefore,  not  accurately  predictable  even 
by  excellent  physiologic  measures  made  at  the  time  of  admission.  Further  study  in 
this  area  is  clearly  needed. 

4.  The  5-6  Percent  Limit  Medicare  would  need  to  spend  more  dollars  than  it  did 
in  the  first  year  of  PPS  in  order  to  provide  substantial  protection  against  the 
additional  costs  of  outlier  cases.  Available  data  is  not  conclusive.  The  picture  is 
especially  complicated  by  the  entry  of  New  York  and  Massachusetts  into  PPS 
l)ecause  these  States  are  expected  to  have  a  very  large  volume  of  outliers  and  may 
reduce  the  availability  of  outlier  payments  for  other  States. 

5.  Effect  on  Other  Adjustments.  We  have  already  pointed  out  that  PPS 
originally  paid  twice  for  outliers  in  teaching  hospitals  and  urban  hospitals—it  first 
adjusted  payments  in  a  way  which  reflected  these  hospitals'  tendency  to  have  more 
outliers  and  then  paid  extra  for  the  individual   outliers  as  they  occurred.      The 

4-46 


Omnibus  Budget  Reconciliation  Act  of  1986  corrected  this  problem  for  urban/ rural 
differences  by  adjusting  urban  and  rural  rates  separately  for  outlier  payments,  but 
the  problem  remains  for  teaching  adjustments.  Any  change  in  outlier  payment 
policy  which  increased  payments  to  teaching  hospitals  would  aggravate  this  problem 
mless  corresponding  adjustments  were  made  to  payments  for  indirect  teaching 
costs. 

6.  Do  "Marginal  Costs"  Define  Appropriate  Payments?    This  analysis  highlights 
two  problems  with  the  marginal  cost  concept: 

—  Marginal  costs  are  essentially  undefinable  unless  substantial  additional 
reporting  requirements  are  imposed  on  hospitals.  Even  if  such  reporting 
requirements  were  imposed,  estimating  the  relation  of  marginal  costs  to 
reported  charges  would  remain  very  difficult.  Recent  data  suggest  that 
marginal  costs  are  close  to  average  costs  and  that  the  effort  to  define 
marginal  costs  more  accurately  may  be  of  limited  significance. 

—  True  marginal  cost  is  the  hospital's  actual  cost  for  providing  service,  and 
paying  true  marginal  cost  means  reverting,  for  outlier  cases,  to  reimbursing 
full  costs.  Thus,  once  a  patient  had  passed  the  outlier  threshold,  the  hospital 
would  t>e  operating  under  the  same  inflationary  incentives  which  led  to  PPS  in 
the  first  place.  This  risk  is  significant  t)ecause  not  all  outlier  costs  result 
from  appropriate  care  for  which  Medicare  should  pay. 

These  arguments  suggest  a  need  for  careful  research  into  the  costs  which  hospitals 
actually  incur  in  treating  outliers.  Based  on  such  research.  Medicare  should  t>e  able 
to  craft  a  policy  which  comes  much  closer  to  paying  hospital  costs  without  drifting 
back  into  the  pre-PPS  incentives.  Such  research  should  include  careful  examination 
of  the  m£u*ginal  costs  of  very  inexpensive  cases  to  determine  whether  somewhat 
decreased  payments  for  these  cases  might  t>e  approriate  as  an  approach  to  funding 
higher  payments  for  expensive  outliers. 
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7.  Why  Do  Patients  Became  Outliers?  Guterman's  study  shows  that  the  typical 
outlier  patient  is  a  (fisproportionate  consumer  of  intensive  care  and  ancQlary 
services;  in  short,  the  average  patient  is  sicker.  But  his  analysis  does  not  show  the 
range  of  patient  conditions  and  does  not  show  whether  a  significant  minority  of 
outliers  are  seriously  but  not  acutely  ill  patients  who  could  be  cared  for  in  alternate 
settings;  payment  strategies  for  such  patients  might  appropriately  set  lower 
payment  rates.  A  second  issue  is  the  degree  to  which  outliers  are  patients  whose 
great  cost  is  predictable  at  admission  or  whether  they  largely  represent  patients 
whose  failure  to  respond  to  treatment  is  not  predictable.  Further  research  on  these 
issues  will  l>e  an  important  part  of  refining  outlier  payment  policy. 
E.  A  TYPOLOGY  OF  OUTLIERS.  This  analysis  can  be  viewed  from  a  different 
perspective  by  considering  the  relationship  of  why  cases  become  outliers  to  the 
problems  they  present  and  the  solutions  which  may  t>e  appropriate.  Outlier  cases 
can  have  a  number  of  sources  and  awareness  of  this  variety  can  clarify  the  problem 
of  outlier  payments.  (The  following  language  addresses  cost,  but  the  same  logic 
applies  to  stay  outliers.) 

1.  Qualitatively  Different  Cases  which  do  not  t)elong  with  other  cases  assigned 
to  the  DRG.  These  are  typically  relatively  uncommon  disorders  which  are  too 
infrequent  to  have  their  own  DRG  but  which  are  very  different  from  other  disorders 
in  the  same  DRG.  For  example,  acute  necrosis  of  the  liver,  a  catastrophic  but 
uncommon  disorder,  is  in  DRG  205  along  with  a  collection  of  liver  diseases  which 
are  generally  much  milder  and  much  less  expensive  to  treat. 

2.  Patients  needing  unusual  treatment  which  is  very  different  in  cost  from 
treatments  for  other  patients  in  the  DRG.  Radical  surgery  and  tone  marrow 
transplants  illustrate  this  kind  of  outlier. 

3.  Patients  with  an  unusually  severe  illness  at  the  time  of  admission  compared 
to  other  patients  in  the  same  DRG.  Severity  may  be  measured  either  directly  by  the 
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severity  of  the  problem  (a  90  percent  burn  is  worse  than  a  70  percent  burn)  or  by 
ooncirrent  problems  (an  pneumonia  is  more  serious  in  a  patient  with  AIDS  than  in  an 
otherwise  healthy  patient). 

4.  Patients  who  develop  complications  or  who  unpredictably  fail  to  respond  to 
treatment  despite  being  apparently  similar  at  admission  to  patients  who  have  an 
uncomplicated  course  in  the  hospital. 

5.  Otherwise  expensive  patients  who  become  outliers  because  they  are  treated 
in  expensive  facilities  whether  the  result  of  their  care  is  a  better  outcome  or  simply 
a  higher  bill. 

This  paradigm  has  implications  for  the  kinds  of  problems  which  can  be  expected 
and  for  how  improvements  can  be  made.  While  groups  1,  2,  and  3  may  be  expected 
to  concentrate  in  specialized  referral  facilities,  patients  in  group  4  should  t>e 
randomly  distributed  across  hospitals.  We  have  already  pointed  out  that  patients  in 
expensive  facilities  pose  problems  for  which  solutions  are  more  distant.  Thus, 
groups  1.  2,  and  3  represent  case  mix  problems  while  group  4  represents  an  insurance 
problem.  To  the  extent  that  case  mix  problems  are  associated  with  hospital  size, 
groups  1,  2,  and  3  present  special  problems  for  large  hospitals.  On  the  other  hand, 
group  4  presents  special  problems  for  small  hospitals  which  are  not  protected  from 
random  events  by  a  large  volume  of  admissions.  While  improvements  in  the 
diagnosis  coding  and  classification  might  be  most  useful  lor  group  1  and 
improvements  in  procedure  coding  and  classification  might  be  most  useful  for  group 
2,  admission  severity  measures  would  be  more  useful  for  group  3  and  an  insurance 
strategy  focused  on  very  high  cost  outliers  would  be  most  useful  for  group  4.  While 
this  typology  does  not  provide  solutions  to  any  of  the  problems,  it  does  show  why  a 
multifaceted  strategy  is  necessary. 

F.  SUMMARY.    Guterman's  study  makes  dear  that  outlier  payments  are  an  area  in 
which  major  steps   can   t>e   taken   to   bring   Medicare   payments   closer  to   costs. 
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Because  the  finite  numl)er  of  DRG  groups  makes  coping  with  outliers  an  issue, 
outliers  are  an  area  in  which  DRG  refinement  is  likely  to  l>e  successful.  Outlier 
payment  reform,  therefore,  serves  as  an  important  complement  to  DRG  refinement. 
Further  research  in  this  area  has  already  started  and  should  t>e  completed  in  time 
for  consideration  for  regulations  effective  October  1,  1987. 

Vn.  SHOULD  PPS  PAY  ACCORDING  TO  HOSPITAL  SIZE? 
We  have  noted  repeatedly  that  the  designers  of  PPS  made  a  policy  call  when 
they  decided  that  the  increased  costs  of  larger  hospitals  resulted  from  inefficiency 
and  should  not  be  recognized  in  the  payment  system.  There  was  little  empirical 
evidence  either  for  or  against  their  position.  TTiis  section  reviews  the  available 
evidence  on  this  issue  and  examines  some  of  the  alternatives  to  directly  paying  more 
for  care  in  larger  hospitals. 
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A.  EVIDENCE  REGARDING  HOSPITAL  SIZE.  Evidence  that  hospital  size  is 
associated  with  cost  per  case  is  well  established.  Every  analysis  of  HCFA  data  and 
most  other  studies  as  well  have  confirmed  the  relationship.  Section  III  pointed  out 
that  the  effects  of  hospital  size  appear  to  primarily  reflect  the  relatively  low  cost 
of  hospitals  smaller  than  a  hundred  beds  rather  than  substantial  differences  among 
larger  hospitals.  These  findings  are  not  consistent  with  the  view  of  the  architects  of 
prospective  payment  that  increasing  costs  with  increasing  size  represent 
inefficiency.  If  inefficiency  resulted  from  large  size,  we  would  not  expect  it  to 
taper  off  so  abruptly  at  100-230  beds.  Rather,  the  most  plausible  explanation  is  that 
these  smaller  hospitals  have  diminished  severity  of  illness  or  intensity  of  care. 

Thus,  the  model  of  hospital  size  in  the  usual  payment  equation  (Table  ^.lA)  is 
imprecise  because  it  implies  that  costs  continue  to  increase  as  hospital  size 
increases.  When  a  more  accurate  model  is  used  (Table  ^.ID),  hospital  size  has  little 
effect  on  the  teaching  adjustment  (compare  the  coefficients  for  teaching  in  Tables 
if. IB  and  ^. ID). 

Thorpe  et  al.'s  (1986)  work  suggests  that  DRG  weight  compression  is  associated 
with  hospital  size.  Table  ^.ID  confirins  this  by  showing  that,  when  the  equation  is 
constructed  to  recognize  the  lower  costs  of  small  hospitals,  the  evidence  of 
compression  almost  disappears  (that  is,  the  coefficient  for  case  mix  index  becomes 
close  to  1.0).  Apparently,  compression  results  in  higher  weights  for  cases  found 
largely  in  smaller  hospitals  and  lower  weights  for  cases  found  largely  in  larger 
hospitals. 

B.  ALTERNATIVE  APPROACHES.  These  technical  findings  suggest  that  even 
though  paying  hospitals  according  to  size  remains  an  undesirable  policy,  there  are 
three  strategies  which  might  result  in  more  equitable  treatment  of  hospitals  of 
different  size  without  paying  according  to  size. 
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1.  Decompressing  Weights.  Compressed  DRG  weights  increase  the  teaching 
adjustment.  Decompressing  the  weights  would  result  in  somewhat  larger  payments 
to  nonteaching  hospitals  with  a  complex  case  mix  and  somewhat  smaller  payments 
to  teaching  hospitals.   This  would  be  fairer  to  large  nonteaching  hospitals. 

2.  Increase  Outlier  Payments.  Because  outliers  concentrate  in  large  hospitals 
as  well  as  in  urban  and  teaching  hospitals,  outlier  payments  which  came  closer  to 
costs  will  result  in  fairer  payments  to  large  hospitals. 

3.  Transfer  Adjustments.  (Bobula  1986)  has  suggested  adjusting  hospital 
payments  by  increasing  payments  to  hospitals  which  receive  transfers  and 
decreasing  payments  to  those  making  transfers.  Since  transfers  overwhelmingly 
flow  from  smaller  to  larger  hospitals,  this  approach  would  also  be  fairer  to  large 
hospitals. 

VIII.  HOSPITAL-LEVEL  VARIABLES  FOR  CASE  MIX  ADJUSTMENT 
In  this  section  we  examine  the  idea  of  adjusting  payment  at  the  hospital  level  and 
focus  on  some  adjustors  which  may  become  available.    There  are  two  reasons  why 
hospital-level  adjustments  may  be  preferred  to  patient-level  adjustments: 

—characterizing  a  hospital  as  efficient  or  inefficient  or  as  having  a  simple  or 

complex  case  mix  may  be  easier  than  characterizing  individual  cases, 
—if  we  wish  to  protect  certain  hospitals  for  policy  reasons,  this  may  be  better 

achieved  by  hospital-level  adjustments  than  by  case  mix  adjustments. 
The  two  most  visible  hospital-level  adjustments  in  PPS  are  teaching  status  and 
urban/rural;  we  have  already  seen  that  PPS  specifically  does  not  recognize  hospital 
size  or  allow  the  wage  index  to  serve  as  a  proxy  for  case  mix  beyond  its  direct  fiscal 
effects  on  hospital  costs.  If  the  hospital  can  be  characterized  accurately,  then  It  Is 
possible    to   pay    some   or   all   cases   at  a  rate   which   is   related   to  costs  actually 
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incurred.  There  are  two  broad  approaches  to  characterizing  hospitals--historical 
evidence  and  analysis  of  current  data. 

PPS  uses  historical  evidence  for  the  hospital-specific  portion  of  payments  and 
for  regional  adjustments  during  the  transition  to  national  rates.  Hospitals  exempted 
or  excluded  from  PPS  are  also  paid  on  the  basis  of  historical  costs.  Nevertheless, 
historical  costs  are  most  useful  as  a  cushion  to  protect  hospitals  when  the  equity  of 
a  new  rate  system  is  uncertain  or  when  transition  rnay  be  too  rapid. 

Current  data  have  at  least  two  uses:  characterizing  certain  tracer  conditions  to 
determine  the  efficiency  of  the  hospital  and  characterizing  certain  aspects  of  case 
mix  in  order  to  make  inferences  about  overall  case  mix. 

A.  TRACER  CONDITIONS.  A  tracer  is  a  condition  which  can  be  precisely  and 
homogeneously  defined  so  that  different  hospitals  can  be  characterized  by 
differences  in  the  way  they  handle  this  standardized  condition.  Kessner  (1973) 
pioneered  this  method  in  studying  common  nonhospital  illnesses,  but  List  et  al., 
(1983)  showed  that  variations  in  the  severity  of  myocardial  infarction  could  be 
controlled  so  as  to  use  that  disease  as  a  tracer.  Cromwell  et  al.'s  (1985)  work  uses  a 
variety  of  "tracers"  which  are  generally  less  sharply  defined.  In  principle,  tracers 
can  be  used  to  define  the  relative  efficiency  of  different  hospitals,  and  this  relative 
efficiency  could  then  be  as  a  basis  for  either  hospital-specific  or  hospital  type 
adjustments  of  payments.  The  difficulty  with  such  methods  is  that  cross- 
subsidization  is  so  large  for  some  conditions  in  some  kinds  of  hospitals  that  costs  for 
individual  admissions  cannot  be  accurately  defined.  This  strategy  will,  therefore, 
require  significant  further  development  before  it  is  fully  satisfactory  even  as  a 
research  strategy. 

3.  TRANSFERS.  We  have  already  described  work  at  HCFA  by  (Bobula  1986)  that 
examines  counting  patient  transfers  as  a  tool  for  characterizing  hospitals.    They  find 
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that  hospitals  receiving  more  transfers  have  higher  average  costs  than  the  typical 
hospital,  even  after  correcting  for  case  rnix,  wage  index,  location,  and  teaching 
activity.  Conversely,  hospitals  sending  more  transfers  have  significantly  lower  than 
average  costs  per  case.  Measurement  strategies  of  this  kind  may  permit 
significantly  greater  accuracy  in  characterizing  hospitals  for  payment  purposes,  but 
the  policy  implications  of  such  measures  present  significant  difficulties:  in  effect,  a 
hospital  might  be  paid  quite  heavily  not  to  transfer  patients.  A  related,  and  perhaps 
less  manipulable,  measure  might  be  the  distance  which  patients  come  to  the  hospital 
compared  to  nearby  hospitals. 

A  very  different  way  of  characterizing  hospitals  is  implicit  in  the  finding 
tertiary  and  nontertiary  hospitals  have  very  different  cost  patterns.  If  this  finding 
proves  to  be  robust,  patterns  of  costs  may  provide  ways  to  characterize  individual 
hospitals  more  accurately. 

IX.  THE  ADEQUACY  OF  CURRENT  CASE  MIX  ADJUSTMENT  FOR  NATIONAL  RATES 
The  phased  implementation  of  national  PPS  payment  rates  from  a  hospital- 
specific  base  provided  both  an  opportunity  to  develop  more  refined  case  measures 
and  time  to  determine  the  desirability  and  feasibility  of  moving  to  national  rates. 
The  foregoing  analysis  has  made  clear  that  major  refinements  of  the  case  mix 
adjustment  system  are  not  ready,  which  places  extra  emphasis  on  the  adequacy  of 
the  current  system  for  national  payment  rates. 

A.  DECREASING  MARGIN  FOR  ERROR  IN  PPS  RATE  SETTING.  During  the  first 
years  of  PPS,  most  hospitals  were  protected  from  possible  payment  imprecision 
under  national  rates  not  only  by  the  blend  of  regional  and  hospital-specific  rates  but 
also  by  high  payments  relative  to  historical  costs  and  substantial  gains  in  efficiency. 
In  the  immediate  future,  hospitals  will  be  pressed  not  only  by  the  removal  of  these 


protections  but  also  by  an  increasingly  competitive  health  care  environment, 
problems  in  financing  uncompensated  care,  and  a  Federal  budget  which  is 
increasingly  constrained  by  "Gramm-Rudman-Hollings*'  (the  Balanced  Budget  and 
Emergency  Deficit  Control  Act  of  I9S5:  P.L.  99-177).  Hospitals  will  have  fewer 
And  fewer  alternative  sources  of  income  to  which  they  can  turn  if  Medicare's 
payments  are  too  low.  On  the  other  hand,  Medicare  currently  provides  average 
margins  in  excess  of  12  percent,  which  leaves  substantial  protection  against  any 
error. 

B.  THE  LIMITATIONS  OF  THE  CURRENT  CASE  MIX  ADJUSTMENT  SYSTEM. 
Differences  in  hospital  costs  appear  to  stem  from  a  variety  of  factors  which  are  not 
reflected  in  PPS.  We  have  reviewed  studies  which  suggest  that  there  are  significant 
differences  in: 

—  severity   of   illness   within   DRGs,  not   only    among  patients  but   also  among 
hospitals, 

—  the  way  hospitalization  is  used,  not  only  among  patients  and  hospitals  but  also 
across  States  and  perhaps  regions, 

■»- local  standards  for  appropriate  intensity  of  care,  among  hospitals,  types  of 
hospitals,  and  States,  and 

—  costs,  resulting  from  all  of  these  factors  and  also  from  patterns  of  referral, 
kinds  of  admissions,  and  specialization  in  certain  kinds  of  disease. 

C.  STRATEGY  FOR  NATIONAL  RATES.  Because  we  do  not  have  satisfactory 
measures  of  severity  of  illness  or  satisfactory  ways  to  assess  what  intensity  of  care 
is  medically  appropriate,  we  cannot  resolve  the  question  of  which  differences 
between  costs  and  payments  are  appropriate.  Clearly,  perfect  correlation  between 
costs  and  payments  would  be  equivalent  to  cost-based  reimbursement;  because  there 
still  appears  to  be  substantial  room  for  increasing  efficiency  in  the  hospital  industry, 


payments  ^vhich  perfectly  reflect  costs  would  not  be  a  desirable  goal.  On  the  other 
hand,  we  cannot  reach  a  technical  judgment  whether  the  degree  of  match  between 
costs  and  payments  which  we  have  found,  and  which  we  project  to  occur  under 
national  rates,  is  the  correct  one.  The  difference  between  costs  and  payments  may 
reflect  correctable  inefficiencies  for  which  Medicare  should  not  pay;  if  this  is  true, 
then  the  high  operating  margins  realized  by  some  hospitals  are  useful  incentives  to 
other  hospitals  to  become  more  efficient.  On  the  other  hand,  the  differences 
between  costs  and  payments  may  mean  that  hospitals  with  severely  ill  patients  and 
medically  appropriate  intense  care  are  being  penalized  by  a  system  which  does  not 
fully  measure  their  special  responsibilities.  Current  technology  does  not  allow  us  to 
deteriTiine  which  of  these  two  scenarios  is  most  correct. 

Under  these  circumstances,  close  monitoring  of  the  impact  of  PPS  on  providers 
and  patients  becomes  especially  important,  and  close  cooperation  between  Congress 
and  the  Administration  in  charting  the  future  course  of  the  program  will  be 
essential.  The  refineinents  of  case  inix  measurernent  which  could  improve  payment 
of  severity  and  complexity  are  also  the  research  tools  necessary  to  determine 
whether  PPS  payments  are  equitable;  further  refinement  of  case  mix  adjustment 
methods  is  an  integral  part  of  the  future  of  PPS.  The  next  chapter  describes  an 
agenda  for  work  in  this  area. 

D.  SUMMARY.  We  have  examined  the  reasons  why  intensity  of  care  differs  across 
hospitals.  We  found  that  the  best  current  estimates  indicate  that  the  impact  of  PPS 
has  been  somewhat  different  than  that  which  was  predicted  and  raised  some 
question  that  the  current  case  mix  adjustment  system  is  sufficiently  precise  for 
national  rates.  We  identified  the  outlier  payment  system  as  the  area  in  which 
greatest  progress  can  be  made  most  rapidly  to  bring  PPS  payments  into  closer 
align  nent    with    hospital    costs.      Finally,   we    suggested    ways    in    which    case    mix 
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measurement    could     make    greater     use    of    hospital-level    adjustments    without 
compromising  the  basic  incentives  of  PPS. 
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Table  4.1 


PREDICTORS  OF  HOSPITAL  AVERAGE  COST  PER  CASE 
Percentage  Increase  for  Every  1.0  Percent  Increase  in  Variable 


1984  Costs 


0.954 

0.583 

0.959 

9.91  ) percent 

3.78  ) higher 

2.54  )than  rural 

0.086 


1981  Costs 

A:  Basic  IVbdel 

Case  Mix  Index 

1.014 

Teaching* 

0.579 

Wage  Index 

1.019 

Large  SMSA 

11.60 

Medium  SMSA 

2.76 

Small  SMSA 

0.10 

Number  of  Beds 

0.119 

R^ 


0.725 


0.702 


B:  Without  Hospital  Size  (beds) 


Case  Mix  Index 

1.464 

Teaching* 

0.845 

Wage  Index 

0.980 

Large  SMSA 

23.18 

Medium  SMSA 

11.96 

Small  SMSA 

9.07 

1.295 

0.766 

0.863 

19.97  ) percent 
11.34  ) higher 

9.35  )than  rural 


R^ 


0.692 


0.682 


C:  Coefficients  Reflecting  Aictual  1984  Payment  Procedures 


Case  Mix  Index 
Teaching* 
Wage  Index 
Urban 


1.000-" 
1.158^ 
0.7915^ 
25.34 


R^ 

1 


0.666 
Value  mandated  by  law  or  regulation. 


1.000^ 
1.158^ 
0.7915^ 
18.70  percent  higher 
than  rural 
0.669 


*  Teaching  is  defined  as  the  ratio  of  interns  and  residents  to  hospital 
beds. 

1981  (N=4971):  hospitals  with  >49  Medicare  discharges  in  1981  and  any  1984 
discharges. 

1984  (N=5349):  hospitals  with  >49  Medicare  discharges  and  with  available 
Cost  Reports. 
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Table  4.1 
(continued) 

PREDICTORS  OF  HOSPIT?\L  AVERAGE  COST  PER  CASE 
Percentage  increase  for  Every  1.0  Percent  Increase  in  Variable 


1981  Costs 


1984  Costs 


D:  With  Hospital  Bed  Size  Groups 


Case  Mix  Index 

1.097 

Teaching* 

0.742 

Wage  Index 

1.001 

Large  SMSA 

13.21 

Medium  SMSA 

3.91 

Small  SMSA 

1.04 

51-100  Beds 

14.11 

101-250  Beds 

25.24 

251-500  Beds 

29.55 

More  than  500  beds 

32.81 

R^ 


0.725 


1.003 

0.702 

0.943 
10.57  ))  percent 

4.42  ))  higher 

2.87  ))  than  rural 

9.10)  percQit 
18.27)  higher  than 
21.07)  1-50  bed 
21.15)  hospitals 

0.702 


Case  Mix  Index 
r2 


2.851 
0.401 


2.298 
0.423 


F; 


Case  Mix  Index 

1.469 

Teaching* 

0.899 

Wage  Index 

1.436 

r^ 


0.670 


1.376 
0.846 
1.191 

0.669 


*  Teaching  is  defined  as  the  ratio  of  interns  and  residents  to  hospital 
beds. 

1981  (N=4971):  hospitals  with  >49  Medicare  discharges  in  1981  and  any  1984 
discharges. 

1984  {N=5349):  hospitals  with  >49  Medicare  discharges  and  with  available 
Cost  Reports. 


Source:  Medicare  Cost  Reports. 


Table 

4. 

2 

A/ERAGE  COST 

PER 

Iv|h;i)ICARE  CASE  IN 

Effect^ 

Cumulative^  R^ 

0.971 

0.487 

1.275 

0.641 

0.686 

0.661 

11.068 

0.672 

0.719 

0.674 

Variable 

Wage  Index 
Case  Mix  Index 
Teaching  Activity 
Urban  location 
Disproportionate  share 

Including  Outlier  Payinents 

Wage  Index  0.891  0.487 

Case  Mix  Index  1.253  0.641 

Teaching  Activity  0.336  0.661 

Urban  location  6.716  0.672 

Disproportionate  share  0.133  0.674 

Outlier  payments^  2.571  0.737 

Controlling  for  Hospital  Size 


Wage  Index 

1.031 

0.482 

Case  Mix  Index 

0.951 

0.641 

Teaching  Activity 

0.618 

0.661 

Urban  location 

3.821 

0.672 

Disproportionate  share 

0.497 

0.674 

50-99  beds 

9.771 

100-149  beds 

17.626 

150-199  beds 

18.338 

0.696 

200-399  beds 

22.308 

more  than  399  beds 

25.028 

Effects  correspond  to  coefficients  in  a  log-linear  regression  irolel 
which  includes  all  of  the  variables.  For  continuous  variables  (wage 
index,  case  mix  index,  teaching  activity,  disproportionate  share,  and 
outlier  payments)  the  effect  is  the  percent  increase  in  cost  corresponding 
to  a  1.0  percent  increase  in  the  variable.  For  dummy  variables  (urban 
location  and  bed  size)  the  effect  is  the  percent  increase  in  cost  per  case 
i:^  hospital  is  urban  or  if  it  has  the  indicated  number  of  beds,  conpared 
to  a  rural  hospital  or  to  one  with  1-49  beds. 

7   .    .    ?  .         .  ... 

Cumulative  R  is  the  portion  of  the  total  variance  m  hospital 

average  cost  per  case  which  is  accounted  for  in  a  step^^ise  regression  by 

variables  down  to  the  given  value.  All  hospital  size  dummies  are  entered 

together. 

^     .       .  .     .  .       . 

The  disproportionate  share  variable  is  the  percentage  increase  m 

payments  to  the  hospital  which  is  defined  by  legislation;  the  coefficient 
indicates  the  percent  increase  in  cost  which  corresponds  to  a  1.0  percent 
increase  in  payments  required  by  law. 

Outlier  payments  are  the  fraction  of  payments  for  the  hospital  made  to 
the  hospital  estimated  frcm  casemix  during  the  first  year  of  PPS  and 
payment  rules  for  Federal  Fiscal  Year  1986  but  assuming  full 
implementation  of  PPS. 

Source:  Medicare  Cost  Reports.  N=5350  hospitals. 


Table  4.3 

DEATHS  DURING  TVrMISSIQN  AND  WITHTO  30  DAYS  ?\FTER  DISCHARGE 
5  Percent  Saiiple  of  1984  Medicare  Discharges 


Total 

T^dms 

Oregon 

5995 

Utah 

1897 

Wyoming 

725 

Washington 

8292 

Montana 

1802 

Iowa 

7573 

Kansas 

6466 

Minnesota 

9503 

Nevaria 

1691 

Arkansas 

7385 

S  Dakota 

2214 

Wisconsin 

11278 

T^izona 

5946 

Oklahoma 

7779 

N  IVfexico 

2246 

Colorado 

4692 

Nebraska 

4491 

Georgia 

13633 

Missouri 

15294 

Kentucky 

9256 

Texas 

33071 

N  Dakota 

2260 

California 

44630 

Louisianna 

9814 

Idaho 

1807 

Tennessee 

14757 

Indiana 

12871 

Vermont 

784 

Alablam 

10603 

Mississippi  7741 

Ohio 

21572 

Delaware 

1201 

Florida 

34209 

Hawaii 

1156 

N  Carolina 

12150 

Illinois 

27084 

S  Carolina 

6282 

Penna. 

34868 

Michigan 

19938 

New  Hamp. 

1363 

Virginia 

10678 

Maine 

3076 

.ths  Within 

Deaths  During 

30  Days 

Admission 

8.4% 

4  9% 

8.8% 

5.6% 

7.9% 

4.8% 

8.0% 

5.3% 

8.2% 

5.7% 

7.4% 

5.0% 

7.5% 

5.1% 

7.6% 

5.2% 

8.6% 

6.3% 

8.3% 

5.9% 

6.0% 

3.7% 

7.4% 

5.2% 

7.7% 

5.4% 

8.5% 

6.3% 

6.9% 

4.7% 

7.1% 

4.9% 

7.7% 

5.6% 

7.4% 

5.4% 

7.6% 

5.5% 

7.5% 

5.5% 

7.5% 

5.5% 

6.3% 

4.4% 

7.6% 

5.7% 

7.4% 

5.5% 

6.8% 

4.9% 

7.0% 

5.2% 

7.6% 

5.8% 

7.9% 

6.1% 

6.6% 

4.8% 

6.8% 

5.1% 

7.6% 

5.9% 

8.7% 

7.0% 

7.3% 

5.6% 

8.0% 

6.3% 

8.2% 

6.6% 

7.5% 

5.9% 

7.2% 

5.8% 

7.3% 

5.9% 

7.3% 

5.9% 

6.9% 

5.5% 

7.5% 

6.1% 

7.3% 

6.0% 

Average  Length 
Difference   of  Stay 


3.5% 
3.2% 
3.0% 
2.7% 
2.6% 
2.4% 
2.4% 
2.4% 
2.4% 
2.4% 
2.2% 
2.2% 
2.2% 
2.2% 

^  •  ^  O 


2% 
1% 
0% 
0% 
2.0% 
2.0% 
1.9% 
1.9% 
1.9% 
1.8% 
1.8% 
1.8% 
1.8% 
1.8% 
1.8% 
1.8% 
1.7% 
1.7% 
1.6% 
1.6% 
1.5% 
1.5% 
1.4% 
1.4% 
1.4% 


3% 
3% 


6.51 
6.64 
7.52 
6.86 
7.00 
8.12 
8.67 
7.69 
7.81 
7.30 
7.45 
8.35 
7.92 
7.75 
7.32 
7.84 
7.94 
7.60 
8.94 
8.46 
8.14 
7.79 
7.71 
7.85 
6.50 
8.65 
8.89 
8.93 
8.15 
7.76 
9.32 

10.41 
8.42 
8.74 
8.83 
9.72 
8.84 

10.03 
9.31 
8.61 
9.28 
8.44 


(continued  on  next  page) 


TABLE  4.3 
(Continued) 

DEATHS  DURIM3  ?J3y[rSSICaNr  AND  WITHIN  30  DAYS  AFTER  DISCHARGE 
5  Percent  Sanple  of  1984  iVfedicare  Discharges 


TOmL 

DEATHS  WITHIN 

DEATHS  DURING 

AVERAGE  U 

T^IJMS 

30  DAYS 

^JMISSICN 

Dli^'i'EkhNCE   OF  STA^ 

W.  Virginia  5769 

6.7% 

5.4% 

1.2% 

8.17 

Rhode  Is.    2193 

7.7% 

6.6% 

1.1% 

10.33 

Connecticut  6392 

8.1% 

7.1% 

1.0% 

9.84 

Maryland    6107 

7.4% 

6.7% 

0.8% 

10.33 

Dist.  Col.   1293 

6.7% 

6.0% 

0.7% 

12.20 

Mass.      14560 

7.2% 

6.7% 

0.5% 

11.58 

Alaska      226 

4.0% 

3.5% 

0.4% 

7.96 

New  Jersey  15837 

7.7% 

7.6% 

0.1% 

12.11 

New  York   37973 

7.8% 

8.6% 

-0.8% 

13.82 

U.S.      530423  7.5%  5.9%  1.6%        9.11 


Source:  Eggers  P.  Unpublished  Data. 


Table  4.4 

Hospital  T^dmission  and  Surgery  Rates 

Selected  DRG  Clusters  and  Procedures 

Per  1000  Medicare  Beneficiaries 

1982-1983  Medicare  Data 


Michigan 

New 
Jersey 

NbrtH 
Carolina 

Washington 

Cerebrovascular 
Disease 

T^dmissions 
^  Extracranial 

Vascular  Procs 

15.6 
1.0 

16.3 
0.8 

16.1 
0.9 

12.5 
2.0 

Coronary  Artery 
Disease 

?yamissions 

Coronary  Artery 
Bypass  Grafts 

Cardiac  Catheter- 
izations 

19.7 
0.9 
2.2 

16.2 
0.4 
2.0 

22.5 
0.8 
2.8 

13.5 
2.2 
3.1 

Pneumonia 

Admissions 

7.5 

6.8 

9.5 

6.9 

Prostate  Disease 


Admissions 

7.4 

5.9 

7.0 

6.3 

Prostatectcanies 

5.6 

4.6 

5.0 

5.5 

Admissions  without 

Prostatectomy 

1.8 

1.3 

2.0 

0.8 

Source:  Mitchell  et  al.  1986 


Table  4.5 

Death  Rate  Per  100  ?wamissions  By  State 

By  DRG  Within  DRG  Cluster 

1982-3  Medicare  Data 


Cerebrovascular 

New 

North 

Disease 

Michigan 

Jersey 

Carolina 

Washington 

DRG  5 

1.2 

4.1 

1.7 

2.9 

DRG  14 

17.1 

20.2 

16.3 

15.3 

DRG  15 

1.1 

2.0 

0.9 

1.1 

Coronary  Z^rtery 

Disease 

DRG  106 

7.4 

9.1 

5.1 

4.2 

DRG  125 

2.9 

10.3 

4.3 

4.0 

DRG  132 

6.0 

7.5 

6.4 

5.9 

DRG  140 

0.9 

1.1 

0.6 

2.2 

Pneumonia 

DRG  79 

22.8 

25.9 

15.2 

16.8 

DRG  80 

16.3 

11.8 

2.7 

4.3 

DRG  89 

13.1 

15.2 

12.6 

9.1 

DRG  90 

5.9 

4.3 

1.0 

4.2 

Prostate  Disease 


DRG  306 
DRG  336 
DRG  346 
DRG  348 


1.1 

1.4 

1.5 

0.8 

0.5 

0.9 

0.6 

0.2 

5.4 

22.2 

15.4 

15.1 

1.2 

2.6 

1.4 

0.0 

Source:  Mitchell  et  al.  1986 


Table  4.6 

Effect  of  Decoirpressing  DRG  Weights 

With  and  Without  Inclusion  of  Hospital  Bedsize 

Case  Mix  Index  Coefficient  Constrained  to  Reflect  Payment  Rules 

Rand  1981  Synthetic  Costs 


IVDDEL  A 


"Conpressed" 


"Deconpressed" 


Case  Mix  Index 
Teaching 
Wage  Index 
SMSA 


R^ 


10.00 
7.85 
9.03 

15.65 


0.590 


10.00 
6.89 
8.63 
13.60  percent 

higher  than 
rural 
0.596 


MDDET,  B 

"Coitpressed" 

Case  Mix  Index 

10.00 

Teaching 

3.55 

Wage  Index 

10.26 

SMSA 

8.35 

Hospital  Beds 

9.45 

Regional  Differences 

-3.3  to  +10.7 

"Deconpressed" 

10.00 
3.52 
10.29 
2.93 
9.06 
-2.9  to  +10.9 
percent 
corrpared  to 
pacific  region 


R^ 


0.633 


0.633 


See  text. 
Source:  Thorpe  et  al  1986 


Table  4.7 

Average  Operating  Margins  on  Medicare  Cases  by  Class  of  Hospital 
First  Hospital  Fiscal  Years  on  Prospective  Paynent 

Number  of      Operating 
Hospital  Type  Hospitals      Margin  % 


National  Average  4544  16.4 


Urban 

2217 

18.2 

1-99  beds 

545 

14.2 

100-404  beds 

1347 

16.9 

405-684 

277 

20.0 

inore  than  684  beds 

48 

23.0 

Rural 

2327 

8.6 

1-49  beds 

1074 

6.0 

1-99  beds 

1795 

8.3 

100-169  beds 

357 

8.4 

more  than  169  beds 

175 

9.3 

Teaching  status 

Nbnteaching 

3762 

13.4 

Light  {<.25  res/bed) 

660 

19.4 

Heavy  (>.25  res/bed) 

122 

23.3 

Disproportionate  share 

Non-DSH 

3636 

15.6 

Urban  DSH  >100  beds 

670 

18.7 

Urban  DSH  <100  beds 

59 

11.7 

Rural  DSH 

179 

8.7 

Sole  Community/Rural  Referral 

Nbn-SCH/RRC 

4082 

16.9 

Sole  Community 

289 

7.0 

Rural  Referral  Center 

143 

10.2 

Both  SCH  and  RRC 

16 

9.9 

Ownership 

Voluntary 

2700 

17.4 

Proprietary 

649 

14.3 

Government 

1178 

13.2 

Table  4.8 
IVIedicare  Operating  Margin  By  Urban/Rural  Location  and  Number  of  Beds 
Actual  First  Year  of  PPS  and  Projected  Experience  under  Full  National  Rates 


Percent 

of  Hospitals 

Nutnber 

Loss 

Profit 

of 

>20% 

20-10% 

10-5% 

5-0% 

0-5% 

5- 

-10% 

10 

-20%  20-30% 

>30% 

Hosps 

Urban 

PPS  yr  1 

4 

4 

5 

6 

11 

11 

28 

19 

13 

545 

<100 

beds 

National 
rates 

6 

5 

5 

6 

8 

7 

14 

10 

40 

675 

Urban 

PPS  yr  1 

1 

1 

1 

4 

8 

13 

35 

25 

12 

1347 

100- 

404 

National 

3 

6 

5 

9 

10 

10 

20 

14 

21 

1649 

beds 

rates 

Urban 

PPS  yr  1 

0 

0 

0 

3 

5 

13 

32 

29 

18 

277 

405- 

684 

National 

0 

8 

5 

8 

11 

11 

21 

17 

20 

332 

beds 

rates 

Urban 

PPS  yr  1 

0 

0 

0 

0 

1 

3 

15 

16 

13 

48 

>684 

beds 

National 
rates 

0 

10 

7 

3 

11 

11 

21 

17 

20 

70 

Rural 

PPS  yr  1 

6 

8 

7 

11 

12 

12 

26 

12 

7 

1795 

<100 

beds 

National 
rates 

4 

7 

5 

6 

8 

9 

18 

14 

30 

2029 

Rural 

PPS  yr  1 

1 

4 

7 

10 

17 

17 

31 

11 

3 

357 

100- 

169 

National 

4 

10 

6 

11 

11 

8 

20 

15 

13 

397 

beds 

Rural 

PPS  yr  1 

1 

3 

4 

10 

16 

22 

31 

10 

3 

175 

>169 

beds 

National 

4 

7 

5 

7 

11 

10 

26 

14 

17 

197 

All 

PPS  yr  1 

3.0 

4.4 

4.5 

7.6 

10. 

8 

12. 

6 

29.9 

17.7 

9.5 

4544 

hosps 

National 

3.7 

6.9 

5.2 

7.5 

9. 

0 

9. 

4 

18.6 

13.8 

26.0 

5349 

Table  4.9 
IVtedicare  Operating  Margins  By  Urban/Rural  Location  and  Teaching  Status 
Actual  First  Year  of  PPS  and  Projected  Experience  under  Full  National  Rates 


P 

ercent  oj 

f  Hosp 

itals 

- 

Number 

Loss 

Profit 

of 

>20% 

20-10% 

10- 

-5% 

5-0%  0- 

-5%  5- 

10%  10- 

-20%  20 

-30% 

>30% 

Hosps 

Urban 

PPS  yr  1 

2 

2 

3 

5 

10 

13 

32 

22 

11 

1491 

non- 

teach- 

National 

4 

6 

6 

8 

10 

9 

18 

13 

27 

1823 

ing 

Urban 

PPS  yr  1 

0 

1 

1 

3 

6 

12 

27 

37 

15 

608 

light 
teach- 

National 

2 

7 

6 

9 

9 

12 

20 

13 

21 

744 

ing 

Urban 

PPS  yr  1 

0 

1 

0 

3 

4 

3 

27 

30 

31 

118 

heavy- 
teach- 

National 

2 

6 

5 

6 

8 

7 

21 

12 

33 

159 

ing 

Rural 

PPS  yr  1 

5 

7 

7 

11 

13 

13 

27 

12 

6 

2271 

non- 

teach- 

National 

4 

7 

5 

7 

9 

9 

19 

14 

26 

2556 

ing 

Rural 

PPS  yr  1 

2 

4 

2 

2 

17 

15 

37 

13 

8 

52 

light 
teach- 

National 

3 

10 

8 

2 

8 

5 

27 

20 

17 

60 

ing 

Rural 

PPS  yr  1 

0 

0 

0 

0 

0 

25 

0 

25 

50 

4 

heavy 
teach- 

National 

0 

14 

0 

0 

0 

14 

0 

14 

57 

7 

ing 

All 

PPS  yr  1 

3.0 

4.4 

4 

.5 

7.6 

10.8 

12.6 

29.9 

17.7 

9.5 

4544 

hosps 

National 

3.7 

6. 

9 

5. 

2   7.5 

9.0 

9.4 

18.6 

13. < 

3  26.0 

5349 

Table  4.10 

Predictors  of  Hospital  Medicare  Operating  Margins 
with  and  without  control  for  outlier  payments 


Variable  PPS  Year  1^ 


National  Rates 


Wage  Index  -0.054 

Case  Mix  Index  0.370 

Teaching  Intensity  0.107 

Urban  5 .  232 

Disproportionate  share  0.140 

Less  than  50  beds  0.00 

50-99  Beds  2.498 

100-149  beds  1.113 

150-199  beds  0.476 

200-399  beds  0.501 

IVbre  than  399  beds  0.728 


** 
**** 


**** 


-0.244 

0.032 

0.194 

10.950 

0.492 

0.00 

-6.838 

-10.969 

-9.875 

-12.380 

-13.552 


**** 


**** 
**** 


**** 
**** 


**** 


Adjusted  R^ 


0.151 


Wage  Index 

-0.043 

Case  Mix  Index 

0.384 

Teaching  Intensity 

0.119 

Urban 

5.364 

Disproportionate  share 

0.235 

Less  than  50  beds 

0.00 

50-99  Beds 

2.882 

100-149  beds 

1.633 

150-199  beds 

1.226 

200-399  beds 

1.362 

MDre  than  399  beds 

0.541 

Outliers 

-0.507 

*j<c** 


**** 


0.057 

-0.201 

-0.027 

0.287 

10.191 

0.644 

0.00 

-5.679' 

-9.197 

-7.645 

-9.279 

-8.713 

-0.973 


icickie 


**** 
**** 


**** 


:kickic 


Adjusted  R^ 

Number  of  Hospitals 


0.158 


4543 


0.084 


5313 


*=  p  <.05;  **  =  p<.01;  ***  =  p<.001;  ****  =  p<.0001 

Effects  correspond  to  coefficients  in  a  log-linear  regression  model 
which  includes  all  of  the  variables.  For  continuous  variables  (wage 
index,  case  mix  index,  teaching  activity,  disproportionate  share,  and 
outlier  payments)  the  effect  is  the  percent  increase  in  cost  corresponding 
to  a  1.0  percent  increase  in  the  variable.  For  durmy  variables  (urban 
location  and  bed  size)  the  effect  is  the  percent  increase  in  cost  per  case 
if  hospital  is  urban  or  if  it  has  the  indicated  number  of  beds,  conpared 
to  a  rural  hospital  or  to  one  with  1-49  beds. 

7       .         .  ...  .     . 

The  disproportionate  share  variable  is  defined  by  legislation;  the 
coefficient  indicates  the  percent  increase  in  margin  which  corresponds  to 
a  1.0  percent  increase  in  payments  required  by  law. 

■^        .  .  .  ... 

The  coefficient  for  outliers  reflects  the  change  m  operating  margin 

corresponding  to  a  1.0  percent  increase  in  the  percentage  of  a  hospital's 
payments  v^^ich  is  received  from  outliers. 


Source:  Medicare  Cost  Reports. 


Table  4.11 

Variations  in  Hospital  IVtedicare  Operating  Margins  Across  Regions 
Controlling  for  other  hospital  characteristics 


Variable 


PPS  Year  1' 


National  Rates 


New  Ehgland 
Mid  Atlantic 
South  Atlantic  ex  FL 
South  Atlantic:  FL 
East  N  Cent:  XL  &  MI 
East  N  Cent:  IN,  OH,  WI 
East  S  Cent:  MS 
East  S  Cent:  KY,  IN,  AL 
West  N  Cent:  NB,  ND, 
West  N  Cent:  10,  KA, 
West  S  Cent:  TX 
West  S  Cent:  7^,  LA, 
Mountain:  UT 
Mountain:  M,  NM,  NV 
Mountain:  ID,  MT,  CO,  WY 
Pacific:  CA 
Pacific:  7^,  HI 
Pacific:  OR,  WA 

Wage  Index 

Case  Mix  Index 

Teaching  Intensity 

Urban 

Disproportionate  share 

Less  than  50  beds 

50-99  Beds 

100-149  beds 

150-199  beds 

200-399  beds 

MDre  than  399  beds 


SD 
IVD,  MN 

OK 


0.00 

+5.43 

+1.02 

-0.10 

-4.89 

3.63 

3.43' 

-0.16 

2.38 

1.47 


**** 


08 
09 
42 
81 
52 
77 
81 


** 


3.14 


-0.084 
0.382 
0.097 
6.097 
0.173 
0.00 
2.498 
1.113 
0.476 
0.501 
0.728 


**** 


0.00 

i 

+7.71 

i 

+9.55 
-0.59 
-1.41 
6.56' 
14.48 
10.80 
16.36' 
9.07^ 
7.71^ 
11.11 
14.53 
-1.03 
10.62 
3.93' 
7.37. 
16.35 


it*** 


**** 


**** 


-0.147 

-0.011 

0.182 

9.359 

0.479 

0.00 

-5.887' 

-10.841 

-9.134 

-12.012 

-13.669 


*** 


*** 

**** 

**** 


.**** 
**** 


**** 
**** 


**** 


Adjusted  R^ 


*= 


0.163 


p  <.05:  **  =  p<.01;  ***  =  p<.001;  ****  =  p<.0001 


0.091 


For  notes  see  Table  4.10.  Coefficients  for  regions  are  relative  increase  or 
decrease  in  operating  margin  coipared  to  a  otherwise  similar  hospital  in  the 
New  England  region. 


Source:  Medicare  Cost  Reports. 
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Table  4.13 


Characteristics  of  Medicare  Outlier  Cases 
Cases  Paid  Under  PPS  in  Federal  Fiscal  Year  1984 


Tvte 

Of  Outli 

er 

Non- 

All 

Length  of 

PPS 

Variable 

Stay 

Cost 

All 

Outliers 

Cases 

PPS  Discharges 

1.76% 

1.21% 

2.96% 

97.04% 

100.00% 

Urban  Hospitals 

2.07 

1.45 

3.52 

96.48 

100.00 

Rural  Hospitals 

0.98 

0.60 

1.58 

98.42 

100.00 

Teaching  Hospitals 

2.58 

1.24 

3.82 

96.18 

100.00 

Nonteaching  Hospitals 

1.34 

1.19 

2.53 

97.47 

100.00 

Hospital  Days 

9.17 

3.34 

12.51 

87.49 

100.00 

Total  Charges 

10.49 

7.17 

17.66 

82.36 

100.00 

Est.  Operating  Costs 

10.38 

6.80 

17.19 

82.81 

100 . 00 

Medicare  Payments 

3.35 

3.00 

6.35 

93.65 

100.00 

Charges  less  Payments 

34.71% 

21.32 

56.02 

43.98 

100.00 

Special  care  charges 
(percent  of  total) 

Ancillary  charges 

(percent  of  total) 


10.10 


62.12 


14.20 


63.89 


11.76 


62.83 


6.36 


55.68 


7.32 


56.48 


Length  of  Stay 
Total  Charge 
Charge  per  Day 
Est.  (Operating  Cost 
Medicare  payment 
Est.  Operating  Margin 


40.9  21.7  33.1 

$  26,151  $26,012  $26,094 

$    639  $1,200  $788 

$  17,116  $16,385  $16,819 

$  6,457  $8,399  $7,248 

$-10,658  $-7,989  $-9,573 


7.1  7.8 

$  3,714  $  4,377 

$  526  $   559 

$2,473  $2,898 

$3,262  $3,380 

$+791  $+484 


Source:     Stuart  Guterman,  Health  Care  Financing  A±ninistration, 
unpjblished  data. 


Table  4.14 

DISTEtrounCN  OF  OUTLIER  CASES  BY  HOSPITAL  GROUP 

PPS  Discharges  in  Federal  Fiscal  Year  1984 


Hospital  Group 
All  Hospitals 


Outliers  as  a  Percentage  of  PPS  Cases 
Length  of  Stay       Cost  Total 

Outliers       Outliers  Outliers 


1.76% 


1.21% 


2.96% 


Urban 

2.07 

o  <100  beds 

0.93 

o  100-404  beds 

1.78 

O  405-684  beds 

2.59 

o  685+  beds 

3.04 

Rural 

0.98 

o  <100  beds 

0.64 

o  100-169  beds 

0.97 

o  170+  beds 

1.61 

New  England 

2.84 

Mid-Atlantic 

2.05 

South  Atlantic 

1.90 

East  North  Central 

2.10 

East  South  Central 

1.72 

West  North  Central 

1.48 

West  South  Central 

1.32 

MDuntain 

1.16 

Pacific 

1.33 

Major  Teaching 

3.39 

Other  Teaching 

2.46 

Non-Teaching 

1.34 

Nbt-for-Profit 

1.89 

Proprietary 

1.33 

Government 

1.47 

1.45 
0.92 
1.59 
1.39 
1.23 

0.60 
0.40 
0.68 
0.87 

0.59 
1.38 
1.45 
0.84 
1.22 
0.91 
1.02 
1.40 
1.92 

1.15 
1.26 
1.19 

1.15 
2.20 
0.86 


3.53 
1.85 
3.37 
3.98 
4.27 

1.58 
1.04 
1.65 
2.47 

3.42 
3.44 
3.35 
2.94 
2.95 
2.39 
2.34 
2.56 
3.25 

4.54 
3.71 
2.53 

3.04 
3.53 
2.33 


Source:  Health  Care  Financing  ?^dministration,  Office  of  Research  and 
Denonstrations . 


CHAPTER  5 
RESEARCH  FOR  IMPROVING  CASE  MIX  MEASUREMENT  UNDER  PPS 
This  chapter  describes  work  in  progress  on  refining  case  mix  measurement  and 
lays  out  refinements  which  may  become  possible  in  each  of  the  three  major  areas 
we  have  examined: 

—  improving  case  mix  measurement  at  the  individual  case  level, 

—  improving  the  payment  of  outlier  cases,  and 

—  improving  hospital-level  adjustments. 

I.   IMPROVING  MEASUREMENT  AT  THE  INDIVIDUAL  CASE  LEVEL 
We  have  pointed  out  that  there  is  no  single  tested,  feasible,  off-the-shelf 
solution  to  measuring  patient  treatment  needs  more  accurately.    Over  the  next  3 
years,  however,  a  series  of  refinements  show  promise  of  becoming  feasible: 

A.  REFINEMENTS  OF  THE  USE  OF  SECONDARY  DIAGNOSES  IN  THE  DRG 
SYSTEM.  Work  nearing  completion  at  Systemetrics  under  Jon  Conklin  and  work 
just  beginning  at  Yale  under  Robert  Fetter  promise  substantial  improvements  in 
the  use  of  secondary  diagnoses  to  identify  severely  ill  patients  within  the  DRG 
system.  HCFA  will  be  evaluating  the  Systemetrics  work  over  the  next  year  and 
expects  first  results  from  the  Yale  work  in  late  1987.       \ 

B.  EXTEND  RESEARCH  ON  USING  PHYSIOLOGIC  MEASUREMENTS  TO 
REFINE  DRGS.    HCFA  is  already  sponsoring  a  number  of  projects  in  this  area: 

1.  Health  Data  Institute  (Paul  Gertman  Principal  Investigator).  This  is  a 
multidimensional  study  of  the  use  of  physiologic  measures  and  other  data  to  predict 
costs,  including  an  integration  of  inpatient  and  outpatient  data  from  several 
sources. 

5-1 


2.  Tulane  School  of  Public  Health  (Farouk  Alemi  Principal  Investigator).  This 
is  a  study  of  seven  different  instruments  to  predict  the  cost  of  care  for  patients 
with  myocardial  infarction  in  two  hospitals  in  the  greater  New  Orleans  area. 

3.  The  Potential  of  MEDISGRPS  for  DRG  Refinement  (Mark  Moskowitz 
Principal  Investigator).  This  study  is  under  the  auspices  of  the  Brandeis  Health 
Policy  Center.  It  will  examine  existing  data  from  hospitals  subscribing  to  the 
MEDISGRPS  system  to  determine  how  much  payment  accuracy  MEDISGRPS  could 
add  to  DRGs. 

In  addition,  an  interesting  study  is  not  funded  by  HCFA: 

4.  Alternative  Severity  Measures  (William  Thomas  and  Marie  Ashcraft 
Principal  Investigators).  This  study  is  funded  by  a  consortium  of  university 
hospitals  and  the  State  of  Michigan.  It  compares  APACHE,  Computerized  Disease 
Staging,  Clinical  Disease  Staging,  Patient  Management  Categories,  and  DRGs  for 
reliability,  clinical  validity,  and  ability  to  predict  cost  and  length  of  stay. 
Preliminary  results  do  not  indicate  that  any  available  system  is  a  major 
improvement  over  DRGs. 

D.  COMPARATIVE  EVALUATION  OF  PHYSIOLOGIC  MEASURES  OF  SEVERITY 
OF  ILLNESS  AS  REFINEMENTS  TO  DRGs.  An  evaluation  of  these  measures  would 
compare  at  a  minimum,  APACHE,  Computerized  Severity  Index,  and  MEDISGRPS. 
Ideally,  the  study  would  also  test  a  number  of  techniques  identified  in  the  work 
described  in  Section  B.  The  study  would  use  a  nationally  representative  data  base 
and  would  measure  vulnerability  to  manipulation,  reliability,  and  costs  of  data 
collection.  The  study  would  assess  the  usefulness  of  these  methods  both  for 
payment  research  and  for  quality  of  care  assessment  (Chapter  3.VII  discusses 
design  and  methodology  in  somewhat  more  detail).  HCFA  is  currently  exploring 
the  design  of  this  study  as  well  as  its  feasibility. 
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E.  ANALYZE  DATA  COLLECTED  IN  PRO  QUALITY  ASSESSMENT  EFFORTS. 
Data  collected  by  PROs  to  assess  quality  of  care  will  generally  also  bear  on 
severity  of  patient  illness.  HCFA  is  currently  seeking  ways  to  maximize  the 
usefulness  of  PRO  data  for  studies  of  the  relation  between  costs  and  both  severity 
and  intensity.  In  addition,  PRO  data  may  be  usable  to  differentiate  reimbursable 
and  inappropriate  outlier  cases. 

F.  INTERNAL  HCFA  RESEARCH.  HCFA  will  continue  to  pursue  intramural 
studies  of  the  relation  between  beneficiary  characteristics  and  costs  at  the 
individual  DRG  level,  following  up  initial  findings  that  differences  are  small  when 
ORGS  are  aggregated  but  quite  significant  for  individual  DRGs. 

II.  IMPROVE  PAYMENT  OF  OUTLIER  CASES 
HCFA  is  analyzing  Medicare  cost  reports  for  the  first  year  of  PPS.  With  this 
information  it  will  be  possible  to  estimate  the  impact  of  the  current  outlier 
payment  strategy  more  precisely,  to  project  the  impact  of  paying  larger  amounts 
under  the  present  general  structure,  and  to  assess  what  changes  might  be  needed  in 
the  adjustments  for  teaching  and  urban/rural  location  to  reflect  the  fact  that  urban 
hospitals  and  teaching  hospitals  get  the  majority  of  outlier  payments.  This  work 
includes  careful  study  of  the  merits  of  focusing  outlier  payments  on  certain  DRGs 
and  of  the  applicability  of  physiologic  measures  to  outliers.  Studies  in  this  area 
wiU  include  securing  available  information  and  experience  from  Medicaid,  Blue 
Cross,  and  other  plan  which  are  using  DRGs  with  outlier  payment  policies  different 
from  those  of  Medicare. 

The  Rand-UCLA  Health  Policy  Research  Center  is  currently  analyzing  outlier 
payments  with  special  focus  on  policy  alternatives  which  do  not  require  legislative 
changes. 
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III.   IMPROVING  ADJUSTMENTS  AT  THE  HOSPITAL  LEVEL 
The  major  project  in  this  area  is: 

Boston  University  School  of  Medicine  (Lisa  lezzoni  Principal  Investigator). 
This  study  will  examine  the  differences  in  patient  populations  between  teaching 
and  nonteaching  hospitals  in  the  Boston  area. 

In  addition,  internal  HCFA  research  will  be  addressed  at  three  issues: 

—  Can  hospitals  be  characterized  effectively  by  their  sending  of  and  receipt  of 
transfer  patients? 

—  Can  HCFA  use  differences  in  cost  patterns  between  similar  DRGs  in 
teaching  and  nonteaching  hospitals  to  characterize  teaching  activity  more 
accurately  than  the  resident/bed  ratio? 

~  Does  the  distance  patients  travel  to  a  hospital,  compared  to  other  hospitals 
in  the  same  area,  provide  an  index  of  referral  center  status? 
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